SoF: Soft-Cluster Matrix Factorization for Probabilistic Clustering
Abstract
We propose SoF (Soft-cluster matrix Factorization), a probabilistic clustering algorithm which softly assigns each data point into clusters. Unlike model-based clustering algorithms, SoF does not make assumptions about the data density distribution. Instead, we take an axiomatic approach to define 4 properties that the probability of co-clustered pairs of points should satisfy. Based on the properties, SoF utilizes a distance measure between pairs of points to induce the conditional co-cluster probabilities. The objective function in our framework establishes an important connection between probabilistic clustering and constrained symmetric Nonnegative Matrix Factorization (NMF), hence providing a theoretical interpretation for NMF-based clustering algorithms. To optimize the objective, we derive a sequential minimization algorithm using a penalty method. Experimental results on both synthetic and real-world datasets show that SoF significantly outperforms previous NMF-based algorithms and that it is able to detect non-convex patterns as well as cluster boundaries.
Cite
Text
Zhao et al. "SoF: Soft-Cluster Matrix Factorization for Probabilistic Clustering." AAAI Conference on Artificial Intelligence, 2015. doi:10.1609/AAAI.V29I1.9594Markdown
[Zhao et al. "SoF: Soft-Cluster Matrix Factorization for Probabilistic Clustering." AAAI Conference on Artificial Intelligence, 2015.](https://mlanthology.org/aaai/2015/zhao2015aaai-sof/) doi:10.1609/AAAI.V29I1.9594BibTeX
@inproceedings{zhao2015aaai-sof,
title = {{SoF: Soft-Cluster Matrix Factorization for Probabilistic Clustering}},
author = {Zhao, Han and Poupart, Pascal and Zhang, Yongfeng and Lysy, Martin},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2015},
pages = {3188-3195},
doi = {10.1609/AAAI.V29I1.9594},
url = {https://mlanthology.org/aaai/2015/zhao2015aaai-sof/}
}