Improving the Variance of Differentially Private Randomized Experiments Through Clustering
Abstract
Estimating causal effects from randomized experiments is only possible if participants are willing to disclose their potentially sensitive responses. Differential privacy, a widely used framework for ensuring an algorithm’s privacy guarantees, can encourage participants to share their responses without the risk of de-anonymization. However, many mechanisms achieve differential privacy by adding noise to the original dataset, which reduces the precision of causal effect estimation. This introduces a fundamental trade-off between privacy and variance when performing causal analyses on differentially private data. In this work, we propose a new differentially private mechanism, Cluster-DP, which leverages a given cluster structure in the data to improve the privacy-variance trade-off. While our results apply to any clustering, we demonstrate that selecting higher-quality clusters—according to a quality metric we introduce—can decrease the variance penalty without compromising privacy guarantees. Finally, we evaluate the theoretical and empirical performance of our Cluster-DP algorithm on both real and simulated data, comparing it to common baselines, including two special cases of our algorithm: its unclustered version and a uniform-prior version.
Cite
Text
Javanmard et al. "Improving the Variance of Differentially Private Randomized Experiments Through Clustering." Proceedings of the 42nd International Conference on Machine Learning, 2025.Markdown
[Javanmard et al. "Improving the Variance of Differentially Private Randomized Experiments Through Clustering." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/javanmard2025icml-improving/)BibTeX
@inproceedings{javanmard2025icml-improving,
title = {{Improving the Variance of Differentially Private Randomized Experiments Through Clustering}},
author = {Javanmard, Adel and Mirrokni, Vahab and Pouget-Abadie, Jean},
booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
year = {2025},
pages = {26946-26980},
volume = {267},
url = {https://mlanthology.org/icml/2025/javanmard2025icml-improving/}
}