Mixed Bregman Clustering with Approximation Guarantees

Abstract

Two recent breakthroughs have dramatically improved the scope and performance of k -means clustering: squared Euclidean seeding for the initialization step, and Bregman clustering for the iterative step. In this paper, we first unite the two frameworks by generalizing the former improvement to Bregman seeding — a biased randomized seeding technique using Bregman divergences — while generalizing its important theoretical approximation guarantees as well. We end up with a complete Bregman hard clustering algorithm integrating the distortion at hand in both the initialization and iterative steps. Our second contribution is to further generalize this algorithm to handle mixed Bregman distortions , which smooth out the asymetricity of Bregman divergences. In contrast to some other symmetrization approaches, our approach keeps the algorithm simple and allows us to generalize theoretical guarantees from regular Bregman clustering. Preliminary experiments show that using the proposed seeding with a suitable Bregman divergence can help us discover the underlying structure of the data.

Cite

Text

Nock et al. "Mixed Bregman Clustering with Approximation Guarantees." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2008. doi:10.1007/978-3-540-87481-2_11

Markdown

[Nock et al. "Mixed Bregman Clustering with Approximation Guarantees." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2008.](https://mlanthology.org/ecmlpkdd/2008/nock2008ecmlpkdd-mixed/) doi:10.1007/978-3-540-87481-2_11

BibTeX

@inproceedings{nock2008ecmlpkdd-mixed,
  title     = {{Mixed Bregman Clustering with Approximation Guarantees}},
  author    = {Nock, Richard and Luosto, Panu and Kivinen, Jyrki},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2008},
  pages     = {154-169},
  doi       = {10.1007/978-3-540-87481-2_11},
  url       = {https://mlanthology.org/ecmlpkdd/2008/nock2008ecmlpkdd-mixed/}
}