$k$-Means Clustering with Distance-Based Privacy

Abstract

In this paper, we initiate the study of Euclidean clustering with Distance-based privacy. Distance-based privacy is motivated by the fact that it is often only needed to protect the privacy of exact, rather than approximate, locations. We provide constant-approximate algorithms for $k$-means and $k$-median clustering, with additive error depending only on the attacker's precision bound $\rho$, rather than the radius $\Lambda$ of the space. In addition, we empirically demonstrate that our algorithm performs significantly better than previous differentially private clustering algorithms, as well as naive distance-based private clustering baselines.

Cite

Text

Epasto et al. "$k$-Means Clustering with Distance-Based Privacy." Neural Information Processing Systems, 2023.

Markdown

[Epasto et al. "$k$-Means Clustering with Distance-Based Privacy." Neural Information Processing Systems, 2023.](https://mlanthology.org/neurips/2023/epasto2023neurips-kmeans/)

BibTeX

@inproceedings{epasto2023neurips-kmeans,
  title     = {{$k$-Means Clustering with Distance-Based Privacy}},
  author    = {Epasto, Alessandro and Mirrokni, Vahab and Narayanan, Shyam and Zhong, Peilin},
  booktitle = {Neural Information Processing Systems},
  year      = {2023},
  url       = {https://mlanthology.org/neurips/2023/epasto2023neurips-kmeans/}
}