Differentially Private Top-K Selection via Stability on Unknown Domain

Abstract

We propose a new method that satisfies approximate differential privacy for top-$k$ selection with unordered output in the unknown data domain setting, not relying on the full knowledge of the domain universe. Our algorithm only requires looking at the top-$\bar{k}$ elements for any given $\bar{k} \geq k$, thus, enforcing the principle of minimal privilege. Unlike previous methods, our privacy parameter $\varepsilon$ does not scale with $k$, giving improved applicability for scenarios of very large $k$. Moreover, our novel construction, which combines the sparse vector technique and stability efficiently, can be applied as a general framework to any type of query, thus being of independent interest. We extensively compare our algorithm to previous work of top-$k$ selection on the unknown domain, and show, both analytically and on experiments, settings where we outperform the current state-of-the-art.

Cite

Text

Silva Carvalho et al. "Differentially Private Top-K Selection via Stability on Unknown Domain." Uncertainty in Artificial Intelligence, 2020.

Markdown

[Silva Carvalho et al. "Differentially Private Top-K Selection via Stability on Unknown Domain." Uncertainty in Artificial Intelligence, 2020.](https://mlanthology.org/uai/2020/silvacarvalho2020uai-differentially/)

BibTeX

@inproceedings{silvacarvalho2020uai-differentially,
  title     = {{Differentially Private Top-K Selection via Stability on Unknown Domain}},
  author    = {Silva Carvalho, Ricardo and Wang, Ke and Gondara, Lovedeep and Miao, Chunyan},
  booktitle = {Uncertainty in Artificial Intelligence},
  year      = {2020},
  pages     = {1109-1118},
  volume    = {124},
  url       = {https://mlanthology.org/uai/2020/silvacarvalho2020uai-differentially/}
}