PROXIMUS: Software for Summarization of Very High Dimensional Discrete-Valued Datasets

PROXIMUS is a software tool for error-bounded approximation of high-dimensional binary attributed datasets based on nonorthogonal decomposition of binary matrices. This tool can be used for analyzing data arising in a variety of domains ranging from commercial to scientific applications. Using a combination of innovative algorithms, novel data structures, and efficient implementation, PROXIMUS computes a concise representation for very large binary matrices, providing insights into common patterns in the rows and columns of the matrix. PROXIMUS has found application in many areas, including association rule mining, DNA microarray analysis, and business analytics. The original release of PROXIMUS is implemented in C and is freely available as open source below. It was also implemented in R within the CBA (Clustering for Business Analytics) by Christian Buchta and Michael Hahsler and in Java by Jaan Ubi.

Download

Publications