K-Means Clustering with Distance-Based Privacy
Abstract
In this paper, we initiate the study of Euclidean clustering with Distance-based differential privacy. Distance-based privacy is motivated by the fact that it is often only needed to protect the privacy of exact, rather than approximate, locations. We provide constant-approximate algorithms for $k$-means and $k$-median clustering, with additive error depending only on the attacker's precision bound $\rho$, rather than the radius $\Lambda$ of the space. In addition, we empirically demonstrate that our algorithm performs significantly better than previous differentially private clustering algorithms, as well as naive distance-based private clustering baselines.
Cite
Text
Epasto et al. "K-Means Clustering with Distance-Based Privacy." ICML 2023 Workshops: TAGML, 2023.Markdown
[Epasto et al. "K-Means Clustering with Distance-Based Privacy." ICML 2023 Workshops: TAGML, 2023.](https://mlanthology.org/icmlw/2023/epasto2023icmlw-kmeans/)BibTeX
@inproceedings{epasto2023icmlw-kmeans,
title = {{K-Means Clustering with Distance-Based Privacy}},
author = {Epasto, Alessandro and Mirrokni, Vahab and Narayanan, Shyam and Zhong, Peilin},
booktitle = {ICML 2023 Workshops: TAGML},
year = {2023},
url = {https://mlanthology.org/icmlw/2023/epasto2023icmlw-kmeans/}
}