Probabilistic Consensus Clustering Using Evidence Accumulation
Abstract
Clustering ensemble methods produce a consensus partition of a set of data points by combining the results of a collection of base clustering algorithms. In the evidence accumulation clustering (EAC) paradigm, the clustering ensemble is transformed into a pairwise co-association matrix, thus avoiding the label correspondence problem, which is intrinsic to other clustering ensemble schemes. In this paper, we propose a consensus clustering approach based on the EAC paradigm, which is not limited to crisp partitions and fully exploits the nature of the co-association matrix. Our solution determines probabilistic assignments of data points to clusters by minimizing a Bregman divergence between the observed co-association frequencies and the corresponding co-occurrence probabilities expressed as functions of the unknown assignments. We additionally propose an optimization algorithm to find a solution under any double-convex Bregman divergence. Experiments on both synthetic and real benchmark data show the effectiveness of the proposed approach.
Cite
Text
Lourenço et al. "Probabilistic Consensus Clustering Using Evidence Accumulation." Machine Learning, 2015. doi:10.1007/S10994-013-5339-6Markdown
[Lourenço et al. "Probabilistic Consensus Clustering Using Evidence Accumulation." Machine Learning, 2015.](https://mlanthology.org/mlj/2015/lourenco2015mlj-probabilistic/) doi:10.1007/S10994-013-5339-6BibTeX
@article{lourenco2015mlj-probabilistic,
title = {{Probabilistic Consensus Clustering Using Evidence Accumulation}},
author = {Lourenço, André and Bulò, Samuel Rota and Rebagliati, Nicola and Fred, Ana L. N. and Figueiredo, Mário A. T. and Pelillo, Marcello},
journal = {Machine Learning},
year = {2015},
pages = {331-357},
doi = {10.1007/S10994-013-5339-6},
volume = {98},
url = {https://mlanthology.org/mlj/2015/lourenco2015mlj-probabilistic/}
}