SoK: Privacy-Preserving Clustering (Extended Abstract)

Abstract

Clustering is a popular unsupervised machine learning technique that groups similar input elements into clusters. In many applications, sensitive information is clustered that should not be leaked. Moreover, nowadays it is often required to combine data from multiple sources to increase the quality of the analysis as well as to outsource complex computation to powerful cloud servers. This calls for efficient privacy-preserving clustering. In this work, we systematically analyze the state-of-the-art in privacy-preserving clustering. We implement and benchmark today's four most efficient fully private clustering protocols by Cheon et al. (SAC'19), Meng et al. (ArXiv'19), Mohassel et al. (PETS'20), and Bozdemir et al. (ASIACCS'21) with respect to communication, computation, and clustering quality.

Cite

Text

Hegde et al. "SoK: Privacy-Preserving Clustering (Extended Abstract)." NeurIPS 2021 Workshops: PRIML, 2021.

Markdown

[Hegde et al. "SoK: Privacy-Preserving Clustering (Extended Abstract)." NeurIPS 2021 Workshops: PRIML, 2021.](https://mlanthology.org/neuripsw/2021/hegde2021neuripsw-sok/)

BibTeX

@inproceedings{hegde2021neuripsw-sok,
  title     = {{SoK: Privacy-Preserving Clustering (Extended Abstract)}},
  author    = {Hegde, Aditya and Möllering, Helen and Schneider, Thomas and Yalame, Hossein},
  booktitle = {NeurIPS 2021 Workshops: PRIML},
  year      = {2021},
  url       = {https://mlanthology.org/neuripsw/2021/hegde2021neuripsw-sok/}
}