A Kernel-Learning Approach to Semi-Supervised Clustering with Relative Distance Comparisons
Abstract
We consider the problem of clustering a given dataset into k clusters subject to an additional set of constraints on relative distance comparisons between the data items. The additional constraints are meant to reflect side-information that is not expressed in the feature vectors, directly. Relative comparisons can express structures at finer level of detail than must-link (ML) and cannot-link (CL) constraints that are commonly used for semi-supervised clustering. Relative comparisons are particularly useful in settings where giving an ML or a CL constraint is difficult because the granularity of the true clustering is unknown. Our main contribution is an efficient algorithm for learning a kernel matrix using the log determinant divergence (a variant of the Bregman divergence) subject to a set of relative distance constraints. Given the learned kernel matrix, a clustering can be obtained by any suitable algorithm, such as kernel k -means. We show empirically that kernels found by our algorithm yield clusterings of higher quality than existing approaches that either use ML/CL constraints or a different means to implement the supervision using relative comparisons.
Cite
Text
Amid et al. "A Kernel-Learning Approach to Semi-Supervised Clustering with Relative Distance Comparisons." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2015. doi:10.1007/978-3-319-23528-8_14Markdown
[Amid et al. "A Kernel-Learning Approach to Semi-Supervised Clustering with Relative Distance Comparisons." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2015.](https://mlanthology.org/ecmlpkdd/2015/amid2015ecmlpkdd-kernellearning/) doi:10.1007/978-3-319-23528-8_14BibTeX
@inproceedings{amid2015ecmlpkdd-kernellearning,
title = {{A Kernel-Learning Approach to Semi-Supervised Clustering with Relative Distance Comparisons}},
author = {Amid, Ehsan and Gionis, Aristides and Ukkonen, Antti},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2015},
pages = {219-234},
doi = {10.1007/978-3-319-23528-8_14},
url = {https://mlanthology.org/ecmlpkdd/2015/amid2015ecmlpkdd-kernellearning/}
}