SoF: Soft-Cluster Matrix Factorization for Probabilistic Clustering

Abstract

We propose SoF (Soft-cluster matrix Factorization), a probabilistic clustering algorithm which softly assigns each data point into clusters. Unlike model-based clustering algorithms, SoF does not make assumptions about the data density distribution. Instead, we take an axiomatic approach to define 4 properties that the probability of co-clustered pairs of points should satisfy. Based on the properties, SoF utilizes a distance measure between pairs of points to induce the conditional co-cluster probabilities. The objective function in our framework establishes an important connection between probabilistic clustering and constrained symmetric Nonnegative Matrix Factorization (NMF), hence providing a theoretical interpretation for NMF-based clustering algorithms. To optimize the objective, we derive a sequential minimization algorithm using a penalty method. Experimental results on both synthetic and real-world datasets show that SoF significantly outperforms previous NMF-based algorithms and that it is able to detect non-convex patterns as well as cluster boundaries.

Cite

Text

Zhao et al. "SoF: Soft-Cluster Matrix Factorization for Probabilistic Clustering." AAAI Conference on Artificial Intelligence, 2015. doi:10.1609/AAAI.V29I1.9594

Markdown

[Zhao et al. "SoF: Soft-Cluster Matrix Factorization for Probabilistic Clustering." AAAI Conference on Artificial Intelligence, 2015.](https://mlanthology.org/aaai/2015/zhao2015aaai-sof/) doi:10.1609/AAAI.V29I1.9594

BibTeX

@inproceedings{zhao2015aaai-sof,
  title     = {{SoF: Soft-Cluster Matrix Factorization for Probabilistic Clustering}},
  author    = {Zhao, Han and Poupart, Pascal and Zhang, Yongfeng and Lysy, Martin},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2015},
  pages     = {3188-3195},
  doi       = {10.1609/AAAI.V29I1.9594},
  url       = {https://mlanthology.org/aaai/2015/zhao2015aaai-sof/}
}