Privacy for Free: Posterior Sampling and Stochastic Gradient Monte Carlo
Abstract
We consider the problem of Bayesian learning on sensitive datasets and present two simple but somewhat surprising results that connect Bayesian learning to “differential privacy”, a cryptographic approach to protect individual-level privacy while permitting database-level utility. Specifically, we show that under standard assumptions, getting one sample from a posterior distribution is differentially private “for free”; and this sample as a statistical estimator is often consistent, near optimal, and computationally tractable. Similarly but separately, we show that a recent line of work that use stochastic gradient for Hybrid Monte Carlo (HMC) sampling also preserve differentially privacy with minor or no modifications of the algorithmic procedure at all, these observations lead to an “anytime” algorithm for Bayesian learning under privacy constraint. We demonstrate that it performs much better than the state-of-the-art differential private methods on synthetic and real datasets.
Cite
Text
Wang et al. "Privacy for Free: Posterior Sampling and Stochastic Gradient Monte Carlo." International Conference on Machine Learning, 2015.Markdown
[Wang et al. "Privacy for Free: Posterior Sampling and Stochastic Gradient Monte Carlo." International Conference on Machine Learning, 2015.](https://mlanthology.org/icml/2015/wang2015icml-privacy/)BibTeX
@inproceedings{wang2015icml-privacy,
title = {{Privacy for Free: Posterior Sampling and Stochastic Gradient Monte Carlo}},
author = {Wang, Yu-Xiang and Fienberg, Stephen and Smola, Alex},
booktitle = {International Conference on Machine Learning},
year = {2015},
pages = {2493-2502},
volume = {37},
url = {https://mlanthology.org/icml/2015/wang2015icml-privacy/}
}