K-Means Clustering with Distance-Based Privacy

Abstract

In this paper, we initiate the study of Euclidean clustering with Distance-based differential privacy. Distance-based privacy is motivated by the fact that it is often only needed to protect the privacy of exact, rather than approximate, locations. We provide constant-approximate algorithms for $k$-means and $k$-median clustering, with additive error depending only on the attacker's precision bound $\rho$, rather than the radius $\Lambda$ of the space. In addition, we empirically demonstrate that our algorithm performs significantly better than previous differentially private clustering algorithms, as well as naive distance-based private clustering baselines.

Cite

Text

Epasto et al. "K-Means Clustering with Distance-Based Privacy." ICML 2023 Workshops: TAGML, 2023.

Markdown

[Epasto et al. "K-Means Clustering with Distance-Based Privacy." ICML 2023 Workshops: TAGML, 2023.](https://mlanthology.org/icmlw/2023/epasto2023icmlw-kmeans/)

BibTeX

@inproceedings{epasto2023icmlw-kmeans,
  title     = {{K-Means Clustering with Distance-Based Privacy}},
  author    = {Epasto, Alessandro and Mirrokni, Vahab and Narayanan, Shyam and Zhong, Peilin},
  booktitle = {ICML 2023 Workshops: TAGML},
  year      = {2023},
  url       = {https://mlanthology.org/icmlw/2023/epasto2023icmlw-kmeans/}
}