A Split-Merge Framework for Comparing Clusterings

Xiang, Qiaoliang; Mao, Qi; Chai, Kian Ming Adam; Chieu, Hai Leong; Tsang, Ivor W.; Zhao, Zhendong

doi:10.32657/10356/55194

A Split-Merge Framework for Comparing Clusterings

Qiaoliang Xiang, Qi Mao, Kian Ming Adam Chai, Hai Leong Chieu, Ivor W. Tsang, Zhendong Zhao

ICML 2012

doi:10.32657/10356/55194 /icml/2012/xiang2012icml-split/

Abstract

Clustering evaluation measures are frequently used to evaluate the performance of algorithms. However, most measures are not properly normalized and ignore some information in the inherent structure of clusterings. We model the relation between two clusterings as a bipartite graph and propose a general component-based decomposition formula based on the components of the graph. Most existing measures are examples of this formula. In order to satisfy consistency in the component, we further propose a split-merge framework for comparing clusterings of different data sets. Our framework gives measures that are conditionally normalized, and it can make use of data point information, such as feature vectors and pairwise distances. We use an entropy-based instance of the framework and a coreference resolution data set to demonstrate empirically the utility of our framework over other measures.

PDF Semantic Scholar

Cite

Text

Xiang et al. "A Split-Merge Framework for Comparing Clusterings." International Conference on Machine Learning, 2012. doi:10.32657/10356/55194

Markdown

[Xiang et al. "A Split-Merge Framework for Comparing Clusterings." International Conference on Machine Learning, 2012.](https://mlanthology.org/icml/2012/xiang2012icml-split/) doi:10.32657/10356/55194

BibTeX

@inproceedings{xiang2012icml-split,
  title     = {{A Split-Merge Framework for Comparing Clusterings}},
  author    = {Xiang, Qiaoliang and Mao, Qi and Chai, Kian Ming Adam and Chieu, Hai Leong and Tsang, Ivor W. and Zhao, Zhendong},
  booktitle = {International Conference on Machine Learning},
  year      = {2012},
  doi       = {10.32657/10356/55194},
  url       = {https://mlanthology.org/icml/2012/xiang2012icml-split/}
}