A Unified Framework for Model-Based Clustering
Abstract
Model-based clustering techniques have been widely used and have shown promising results in many applications involving complex data. This paper presents a unified framework for probabilistic model-based clustering based on a bipartite graph view of data and models that highlights the commonalities and differences among existing model-based clustering algorithms. In this view, clusters are represented as probabilistic models in a model space that is conceptually separate from the data space. For partitional clustering, the view is conceptually similar to the Expectation-Maximization (EM) algorithm. For hierarchical clustering, the graph-based view helps to visualize critical/important distinctions between similarity-based approaches and model-based approaches. The framework also suggests several useful variations of existing clustering algorithms. Two new variations---balanced model-based clustering and hybrid model-based clustering---are discussed and empirically evaluated on a variety of data types.
Cite
Text
Zhong and Ghosh. "A Unified Framework for Model-Based Clustering." Journal of Machine Learning Research, 2003.Markdown
[Zhong and Ghosh. "A Unified Framework for Model-Based Clustering." Journal of Machine Learning Research, 2003.](https://mlanthology.org/jmlr/2003/zhong2003jmlr-unified/)BibTeX
@article{zhong2003jmlr-unified,
title = {{A Unified Framework for Model-Based Clustering}},
author = {Zhong, Shi and Ghosh, Joydeep},
journal = {Journal of Machine Learning Research},
year = {2003},
pages = {1001-1037},
volume = {4},
url = {https://mlanthology.org/jmlr/2003/zhong2003jmlr-unified/}
}