A Unified Framework for Model-Based Clustering

Abstract

Model-based clustering techniques have been widely used and have shown promising results in many applications involving complex data. This paper presents a unified framework for probabilistic model-based clustering based on a bipartite graph view of data and models that highlights the commonalities and differences among existing model-based clustering algorithms. In this view, clusters are represented as probabilistic models in a model space that is conceptually separate from the data space. For partitional clustering, the view is conceptually similar to the Expectation-Maximization (EM) algorithm. For hierarchical clustering, the graph-based view helps to visualize critical/important distinctions between similarity-based approaches and model-based approaches. The framework also suggests several useful variations of existing clustering algorithms. Two new variations---balanced model-based clustering and hybrid model-based clustering---are discussed and empirically evaluated on a variety of data types.

Cite

Text

Zhong and Ghosh. "A Unified Framework for Model-Based Clustering." Journal of Machine Learning Research, 2003.

Markdown

[Zhong and Ghosh. "A Unified Framework for Model-Based Clustering." Journal of Machine Learning Research, 2003.](https://mlanthology.org/jmlr/2003/zhong2003jmlr-unified/)

BibTeX

@article{zhong2003jmlr-unified,
  title     = {{A Unified Framework for Model-Based Clustering}},
  author    = {Zhong, Shi and Ghosh, Joydeep},
  journal   = {Journal of Machine Learning Research},
  year      = {2003},
  pages     = {1001-1037},
  volume    = {4},
  url       = {https://mlanthology.org/jmlr/2003/zhong2003jmlr-unified/}
}