Unified K-Means Clustering with Label-Guided Manifold Learning
Abstract
K-Means clustering is a classical and effective unsupervised learning method attributed to its simplicity and efficiency. However, it faces notable challenges, including sensitivity to random initial centroid selection, a limited ability to discover the intrinsic manifold structures within nonlinear datasets, and difficulty in achieving balanced clustering in practical scenarios. To overcome these weaknesses, we introduce a novel framework for K-Means that leverages manifold learning. This approach eliminates the need for centroid calculation and utilizes a cluster indicator matrix to align the manifold structures, thereby enhancing clustering accuracy. Beyond the traditional Euclidean distance, our model incorporates Gaussian kernel distance, K-nearest neighbor distance, and low-pass filtering distance to effectively manage data that is not linearly separable. Furthermore, we introduce a balanced regularizer to achieve balanced clustering results. The detailed experimental results demonstrate the efficacy of our proposed methodology.
Cite
Text
Wang et al. "Unified K-Means Clustering with Label-Guided Manifold Learning." Proceedings of the 42nd International Conference on Machine Learning, 2025.Markdown
[Wang et al. "Unified K-Means Clustering with Label-Guided Manifold Learning." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/wang2025icml-unified/)BibTeX
@inproceedings{wang2025icml-unified,
title = {{Unified K-Means Clustering with Label-Guided Manifold Learning}},
author = {Wang, Qianqian and Jiang, Mengping and Ding, Zhengming and Gao, Quanxue},
booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
year = {2025},
pages = {63186-63199},
volume = {267},
url = {https://mlanthology.org/icml/2025/wang2025icml-unified/}
}