Spectral Relaxation for K-Means Clustering
Abstract
The popular K-means clustering partitions a data set by minimiz(cid:173) ing a sum-of-squares cost function. A coordinate descend method is then used to find local minima. In this paper we show that the minimization can be reformulated as a trace maximization problem associated with the Gram matrix of the data vectors. Furthermore, we show that a relaxed version of the trace maximization problem possesses global optimal solutions which can be obtained by com(cid:173) puting a partial eigendecomposition of the Gram matrix, and the cluster assignment for each data vectors can be found by comput(cid:173) ing a pivoted QR decomposition of the eigenvector matrix. As a by-product we also derive a lower bound for the minimum of the sum-of-squares cost function.
Cite
Text
Zha et al. "Spectral Relaxation for K-Means Clustering." Neural Information Processing Systems, 2001.Markdown
[Zha et al. "Spectral Relaxation for K-Means Clustering." Neural Information Processing Systems, 2001.](https://mlanthology.org/neurips/2001/zha2001neurips-spectral/)BibTeX
@inproceedings{zha2001neurips-spectral,
title = {{Spectral Relaxation for K-Means Clustering}},
author = {Zha, Hongyuan and He, Xiaofeng and Ding, Chris and Gu, Ming and Simon, Horst D.},
booktitle = {Neural Information Processing Systems},
year = {2001},
pages = {1057-1064},
url = {https://mlanthology.org/neurips/2001/zha2001neurips-spectral/}
}