Cancer: Another Algorithm for Subtropical Matrix Factorization

Abstract

Subtropical algebra is a semi-ring over the nonnegative real numbers with standard multiplication and the addition defined as the maximum operator. Factorizing a matrix over the subtropical algebra gives us a representation of the original matrix with element-wise maximum over a collection of nonnegative rank-1 matrices. Such structure can be compared to the well-known Nonnegative Matrix Factorization (NMF) that gives an element-wise sum over a collection of nonnegative rank-1 matrices. Using the maximum instead of sum changes the ‘parts-of-whole’ interpretation of NMF to ‘winner-takes-it-all’ interpretation. We recently introduced an algorithm for subtropical matrix factorization, called Capricorn , that was designed to work on discrete-valued data with discrete noise [Karaev & Miettinen, SDM ’16]. In this paper we present another algorithm, called Cancer , that is designed to work over continuous-valued data with continuous noise – arguably, the more common case. We show that Cancer is capable of finding sparse factors with excellent reconstruction error, being better than either Capricorn , NMF, or SVD in continuous subtropical data. We also show that the winner-takes-it-all interpretation is usable in many real-world scenarios and lets us find structure that is different, and often easier to interpret, than what is found by NMF.

Cite

Text

Karaev and Miettinen. "Cancer: Another Algorithm for Subtropical Matrix Factorization." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2016. doi:10.1007/978-3-319-46227-1_36

Markdown

[Karaev and Miettinen. "Cancer: Another Algorithm for Subtropical Matrix Factorization." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2016.](https://mlanthology.org/ecmlpkdd/2016/karaev2016ecmlpkdd-cancer/) doi:10.1007/978-3-319-46227-1_36

BibTeX

@inproceedings{karaev2016ecmlpkdd-cancer,
  title     = {{Cancer: Another Algorithm for Subtropical Matrix Factorization}},
  author    = {Karaev, Sanjar and Miettinen, Pauli},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2016},
  pages     = {576-592},
  doi       = {10.1007/978-3-319-46227-1_36},
  url       = {https://mlanthology.org/ecmlpkdd/2016/karaev2016ecmlpkdd-cancer/}
}