Binary Matrix Factorisation via Column Generation

Abstract

Identifying discrete patterns in binary data is an important dimensionality reduction tool in machine learning and data mining. In this paper, we consider the problem of low-rank binary matrix factorisation (BMF) under Boolean arithmetic. Due to the hardness of this problem, most previous attempts rely on heuristic techniques. We formulate the problem as a mixed integer linear program and use a large scale optimisation technique of column generation to solve it without the need of heuristic pattern mining. Our approach focuses on accuracy and on the provision of optimality guarantees. Experimental results on real world datasets demonstrate that our proposed method is effective at producing highly accurate factorisations and improves on the previously available best known results for 15 out of 24 problem instances.

Cite

Text

Kovács et al. "Binary Matrix Factorisation via Column Generation." AAAI Conference on Artificial Intelligence, 2021. doi:10.1609/AAAI.V35I5.16500

Markdown

[Kovács et al. "Binary Matrix Factorisation via Column Generation." AAAI Conference on Artificial Intelligence, 2021.](https://mlanthology.org/aaai/2021/kovacs2021aaai-binary/) doi:10.1609/AAAI.V35I5.16500

BibTeX

@inproceedings{kovacs2021aaai-binary,
  title     = {{Binary Matrix Factorisation via Column Generation}},
  author    = {Kovács, Réka Á. and Günlük, Oktay and Hauser, Raphael A.},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2021},
  pages     = {3823-3831},
  doi       = {10.1609/AAAI.V35I5.16500},
  url       = {https://mlanthology.org/aaai/2021/kovacs2021aaai-binary/}
}