K-Medoids for K-Means Seeding

Abstract

We show experimentally that the algorithm CLARANS of Ng and Han (1994) finds better K-medoids solutions than the Voronoi iteration algorithm of Hastie et al. (2001). This finding, along with the similarity between the Voronoi iteration algorithm and Lloyd's K-means algorithm, motivates us to use CLARANS as a K-means initializer. We show that CLARANS outperforms other algorithms on 23/23 datasets with a mean decrease over k-means++ of 30% for initialization mean squared error (MSE) and 3% for final MSE. We introduce algorithmic improvements to CLARANS which improve its complexity and runtime, making it a viable initialization scheme for large datasets.

Cite

Text

Newling and Fleuret. "K-Medoids for K-Means Seeding." Neural Information Processing Systems, 2017.

Markdown

[Newling and Fleuret. "K-Medoids for K-Means Seeding." Neural Information Processing Systems, 2017.](https://mlanthology.org/neurips/2017/newling2017neurips-kmedoids/)

BibTeX

@inproceedings{newling2017neurips-kmedoids,
  title     = {{K-Medoids for K-Means Seeding}},
  author    = {Newling, James and Fleuret, François},
  booktitle = {Neural Information Processing Systems},
  year      = {2017},
  pages     = {5195-5203},
  url       = {https://mlanthology.org/neurips/2017/newling2017neurips-kmedoids/}
}