On Differentially Private U Statistics

Abstract

We consider the problem of privately estimating a parameter $\mathbb{E}[h(X_1,\dots,X_k)]$, where $X_1$, $X_2$, $\dots$, $X_k$ are i.i.d. data from some distribution and $h$ is a permutation-invariant function. Without privacy constraints, the standard estimators for this task are U-statistics, which commonly arise in a wide range of problems, including nonparametric signed rank tests, symmetry testing, uniformity testing, and subgraph counts in random networks, and are the unique minimum variance unbiased estimators under mild conditions. Despite the recent outpouring of interest in private mean estimation, privatizing U-statistics has received little attention. While existing private mean estimation algorithms can be applied in a black-box manner to obtain confidence intervals, we show that they can lead to suboptimal private error, e.g., constant-factor inflation in the leading term, or even $\Theta(1/n)$ rather than $O(1/n^2)$ in degenerate settings. To remedy this, we propose a new thresholding-based approach that reweights different subsets of the data using _local Hájek projections_. This leads to nearly optimal private error for non-degenerate U-statistics and a strong indication of near-optimality for degenerate U-statistics.

Cite

Text

Chaudhuri et al. "On Differentially Private U Statistics." Neural Information Processing Systems, 2024. doi:10.52202/079017-0727

Markdown

[Chaudhuri et al. "On Differentially Private U Statistics." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/chaudhuri2024neurips-differentially/) doi:10.52202/079017-0727

BibTeX

@inproceedings{chaudhuri2024neurips-differentially,
  title     = {{On Differentially Private U Statistics}},
  author    = {Chaudhuri, Kamalika and Loh, Po-Ling and Pandey, Shourya and Sarkar, Purnamrita},
  booktitle = {Neural Information Processing Systems},
  year      = {2024},
  doi       = {10.52202/079017-0727},
  url       = {https://mlanthology.org/neurips/2024/chaudhuri2024neurips-differentially/}
}