Spectral Relaxation for K-Means Clustering

Abstract

The popular K-means clustering partitions a data set by minimiz(cid:173) ing a sum-of-squares cost function. A coordinate descend method is then used to find local minima. In this paper we show that the minimization can be reformulated as a trace maximization problem associated with the Gram matrix of the data vectors. Furthermore, we show that a relaxed version of the trace maximization problem possesses global optimal solutions which can be obtained by com(cid:173) puting a partial eigendecomposition of the Gram matrix, and the cluster assignment for each data vectors can be found by comput(cid:173) ing a pivoted QR decomposition of the eigenvector matrix. As a by-product we also derive a lower bound for the minimum of the sum-of-squares cost function.

Cite

Text

Zha et al. "Spectral Relaxation for K-Means Clustering." Neural Information Processing Systems, 2001.

Markdown

[Zha et al. "Spectral Relaxation for K-Means Clustering." Neural Information Processing Systems, 2001.](https://mlanthology.org/neurips/2001/zha2001neurips-spectral/)

BibTeX

@inproceedings{zha2001neurips-spectral,
  title     = {{Spectral Relaxation for K-Means Clustering}},
  author    = {Zha, Hongyuan and He, Xiaofeng and Ding, Chris and Gu, Ming and Simon, Horst D.},
  booktitle = {Neural Information Processing Systems},
  year      = {2001},
  pages     = {1057-1064},
  url       = {https://mlanthology.org/neurips/2001/zha2001neurips-spectral/}
}