A Kernel-Learning Approach to Semi-Supervised Clustering with Relative Distance Comparisons

Abstract

We consider the problem of clustering a given dataset into  k clusters subject to an additional set of constraints on relative distance comparisons between the data items. The additional constraints are meant to reflect side-information that is not expressed in the feature vectors, directly. Relative comparisons can express structures at finer level of detail than must-link (ML) and cannot-link (CL) constraints that are commonly used for semi-supervised clustering. Relative comparisons are particularly useful in settings where giving an ML or a CL constraint is difficult because the granularity of the true clustering is unknown. Our main contribution is an efficient algorithm for learning a kernel matrix using the log determinant divergence (a variant of the Bregman divergence) subject to a set of relative distance constraints. Given the learned kernel matrix, a clustering can be obtained by any suitable algorithm, such as kernel k -means. We show empirically that kernels found by our algorithm yield clusterings of higher quality than existing approaches that either use ML/CL constraints or a different means to implement the supervision using relative comparisons.

Cite

Text

Amid et al. "A Kernel-Learning Approach to Semi-Supervised Clustering with Relative Distance Comparisons." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2015. doi:10.1007/978-3-319-23528-8_14

Markdown

[Amid et al. "A Kernel-Learning Approach to Semi-Supervised Clustering with Relative Distance Comparisons." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2015.](https://mlanthology.org/ecmlpkdd/2015/amid2015ecmlpkdd-kernellearning/) doi:10.1007/978-3-319-23528-8_14

BibTeX

@inproceedings{amid2015ecmlpkdd-kernellearning,
  title     = {{A Kernel-Learning Approach to Semi-Supervised Clustering with Relative Distance Comparisons}},
  author    = {Amid, Ehsan and Gionis, Aristides and Ukkonen, Antti},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2015},
  pages     = {219-234},
  doi       = {10.1007/978-3-319-23528-8_14},
  url       = {https://mlanthology.org/ecmlpkdd/2015/amid2015ecmlpkdd-kernellearning/}
}