Tight Data Access Bounds for Private Top-$k$ Selection
Abstract
We study the top-$k$ selection problem under the differential privacy model: $m$ items are rated according to votes of a set of clients. We consider a setting in which algorithms can retrieve data via a sequence of accesses, each either a random access or a sorted access; the goal is to minimize the total number of data accesses. Our algorithm requires only $O(\sqrt{mk})$ expected accesses: to our knowledge, this is the first sublinear data-access upper bound for this problem. Our analysis also shows that the well-known exponential mechanism requires only $O(\sqrt{m})$ expected accesses. Accompanying this, we develop the first lower bounds for the problem, in three settings: only random accesses; only sorted accesses; a sequence of accesses of either kind. We show that, to avoid $\Omega(m)$ access cost, supporting both kinds of access is necessary, and that in this case our algorithm’s access cost is optimal.
Cite
Text
Wu et al. "Tight Data Access Bounds for Private Top-$k$ Selection." International Conference on Machine Learning, 2023.Markdown
[Wu et al. "Tight Data Access Bounds for Private Top-$k$ Selection." International Conference on Machine Learning, 2023.](https://mlanthology.org/icml/2023/wu2023icml-tight/)BibTeX
@inproceedings{wu2023icml-tight,
title = {{Tight Data Access Bounds for Private Top-$k$ Selection}},
author = {Wu, Hao and Ohrimenko, Olga and Wirth, Anthony},
booktitle = {International Conference on Machine Learning},
year = {2023},
pages = {37635-37655},
volume = {202},
url = {https://mlanthology.org/icml/2023/wu2023icml-tight/}
}