Private Vector Mean Estimation in the Shuffle Model: Optimal Rates Require Many Messages

Abstract

We study the problem of private vector mean estimation in the shuffle model of privacy where $n$ users each have a unit vector $v^{(i)} \in \mathbb{R}^d$. We propose a new multi-message protocol that achieves the optimal error using $O(\min(n\varepsilon^2,d))$ messages per user. Moreover, we show that any (unbiased) protocol that achieves optimal error must require each user to send $\Omega(\min(n\varepsilon^2,d)/\log(n))$ messages, demonstrating the optimality of our message complexity up to logarithmic factors. Additionally, we study the single-message setting and design a protocol that achieves mean squared error $O(dn^{d/(d+2)}\varepsilon^{-4/(d+2)})$. Moreover, we show that any single-message protocol must incur mean squared error $\Omega(dn^{d/(d+2)})$, showing that our protocol is optimal in the standard setting where $\varepsilon = \Theta(1)$. Finally, we study robustness to malicious users and show that malicious users can incur large additive error with a single shuffler.

Cite

Text

Asi et al. "Private Vector Mean Estimation in the Shuffle Model: Optimal Rates Require Many Messages." International Conference on Machine Learning, 2024.

Markdown

[Asi et al. "Private Vector Mean Estimation in the Shuffle Model: Optimal Rates Require Many Messages." International Conference on Machine Learning, 2024.](https://mlanthology.org/icml/2024/asi2024icml-private/)

BibTeX

@inproceedings{asi2024icml-private,
  title     = {{Private Vector Mean Estimation in the Shuffle Model: Optimal Rates Require Many Messages}},
  author    = {Asi, Hilal and Feldman, Vitaly and Nelson, Jelani and Nguyen, Huy and Talwar, Kunal and Zhou, Samson},
  booktitle = {International Conference on Machine Learning},
  year      = {2024},
  pages     = {1945-1970},
  volume    = {235},
  url       = {https://mlanthology.org/icml/2024/asi2024icml-private/}
}