Hierarchical Convex NMF for Clustering Massive Data

Abstract

We present an extension of convex-hull non-negative matrix factorization (CH-NMF) which was recently proposed as a large scale variant of convex non-negative matrix factorization or Archetypal Analysis. CHNMF factorizes a non-negative data matrix $V$ into two non-negative matrix factors $V \approx WH$ such that the columns of $W$ are convex combinations of certain data points so that they are readily interpretable to data analysts. There is, however, no free lunch: imposing convexity constraints on W typically prevents adaptation to intrinsic, low dimensional structures in the data. Alas, in cases where the data is distributed in a non-convex manner or consists of mixtures of lower dimensional convex distributions, the cluster representatives obtained from CH-NMF will be less meaningful. In this paper, we present a hierarchical CH-NMF that automatically adapts to internal structures of a dataset, hence it yields meaningful and interpretable clusters for non-convex datasets. This is also confirmed by our extensive evaluation on DBLP publication records of $760,000$ authors, $4,000,000$ images harvested from the web, and $150,000,000$ votes on World of Warcraft guilds.

Cite

Text

Kersting et al. "Hierarchical Convex NMF for Clustering Massive Data." Proceedings of 2nd Asian Conference on Machine Learning, 2010.

Markdown

[Kersting et al. "Hierarchical Convex NMF for Clustering Massive Data." Proceedings of 2nd Asian Conference on Machine Learning, 2010.](https://mlanthology.org/acml/2010/kersting2010acml-hierarchical/)

BibTeX

@inproceedings{kersting2010acml-hierarchical,
  title     = {{Hierarchical Convex NMF for Clustering Massive Data}},
  author    = {Kersting, Kristian and Wahabzada, Mirwaes and Thurau, Christian and Bauckhage, Christian},
  booktitle = {Proceedings of 2nd Asian Conference on Machine Learning},
  year      = {2010},
  pages     = {253-268},
  volume    = {13},
  url       = {https://mlanthology.org/acml/2010/kersting2010acml-hierarchical/}
}