Revisiting Probabilistic Models for Clustering with Pair-Wise Constraints
Abstract
We revisit recently proposed algorithms for probabilistic clustering with pair-wise constraints between data points. We evaluate and compare existing techniques in terms of robustness to misspecified constraints. We show that the technique that strictly enforces the given constraints, namely the chunklet model, produces poor results even under a small number of misspecified constraints. We further show that methods that penalize constraint violation are more robust to misspecified constraints but have undesirable local behaviors. Based on this evaluation, we propose a new learning technique, extending the chunklet model to allow soft constraints represented by an intuitive measure of confidence in the constraint.
Cite
Text
Nelson and Cohen. "Revisiting Probabilistic Models for Clustering with Pair-Wise Constraints." International Conference on Machine Learning, 2007. doi:10.1145/1273496.1273581Markdown
[Nelson and Cohen. "Revisiting Probabilistic Models for Clustering with Pair-Wise Constraints." International Conference on Machine Learning, 2007.](https://mlanthology.org/icml/2007/nelson2007icml-revisiting/) doi:10.1145/1273496.1273581BibTeX
@inproceedings{nelson2007icml-revisiting,
title = {{Revisiting Probabilistic Models for Clustering with Pair-Wise Constraints}},
author = {Nelson, Blaine and Cohen, Ira},
booktitle = {International Conference on Machine Learning},
year = {2007},
pages = {673-680},
doi = {10.1145/1273496.1273581},
url = {https://mlanthology.org/icml/2007/nelson2007icml-revisiting/}
}