Privacy-Preserving Energy-Based Generative Models for Marginal Distribution Protection

Abstract

We consider learning generative models for sensitive financial and healthcare data. While previous work incorporates Differential Privacy (DP) into GAN training to protect the privacy of individual training instances, we consider a different privacy context where the primary objective is protecting the privacy of sensitive marginal distributions of the true generative process. We propose and motivate a new notion of privacy: \emph{$\alpha$-Level Marginal Distribution Privacy} ($\alpha$-LMDP), which provides a statistical guarantee that the sensitive generative marginal distributions are different from the observed real data. We then propose \emph{Privacy-Preserving Energy Models (PPEMs)}, a novel energy-based generative model formulation where the representations for these attributes are isolated from other attributes. This structured formulation motivates a learning procedure where a penalty based on a statistical goodness of fit test, the \emph{Kernel Stein Discrepancy}, can be applied to only the attributes requiring privacy so that $\alpha$-LMDP may be satisfied without affecting the other attributes. We evaluate this approach using financial and healthcare datasets and demonstrate that the resulting learnt generative models produce high fidelity synthetic data while preserving privacy. We also show that PPEMs can incorporate both $\alpha$-LMDP \emph{and} DP in contexts where both forms of privacy are required.

Cite

Text

Tillman et al. "Privacy-Preserving Energy-Based Generative Models for Marginal Distribution Protection." Transactions on Machine Learning Research, 2023.

Markdown

[Tillman et al. "Privacy-Preserving Energy-Based Generative Models for Marginal Distribution Protection." Transactions on Machine Learning Research, 2023.](https://mlanthology.org/tmlr/2023/tillman2023tmlr-privacypreserving/)

BibTeX

@article{tillman2023tmlr-privacypreserving,
  title     = {{Privacy-Preserving Energy-Based Generative Models for Marginal Distribution Protection}},
  author    = {Tillman, Robert E. and Balch, Tucker and Veloso, Manuela},
  journal   = {Transactions on Machine Learning Research},
  year      = {2023},
  url       = {https://mlanthology.org/tmlr/2023/tillman2023tmlr-privacypreserving/}
}