Marginal Release Under Multi-Party Personalized Differential Privacy
Abstract
Given a set of local datasets held by multiple parties, we study the problem of learning marginals over the integrated dataset while satisfying differential privacy for each local dataset. Different from existing works in the multi-party setting, our work allows the parties to have different privacy preferences for their data, which is referred to as the multi-party personalized differential privacy (PDP) problem. The existing solutions to PDP problems in the centralized setting mostly adopt sampling-based approaches. However, extending similar ideas to the multi-party setting cannot satisfactorily solve our problem. On the one hand, the data owned by multiple parties are usually not identically distributed. Sampling-based approaches will incur a serious distortion in the results. On the other hand, when the parties hold different attributes of the same set of individuals, sampling at the tuple level cannot meet parties’ personalized privacy requirements for different attributes. To address the above problems, we first present a mixture-of-multinomials-based marginal calculation approach, where the global marginals over the stretched datasets are formalized as a multinomial mixture model. As such, the global marginals over the original datasets can be reconstructed based on the calculated model parameters with high accuracy. We then propose a privacy budget segmentation method, which introduces a privacy division composition strategy from the view of attributes to make full use of each party’s privacy budget while meeting personalized privacy requirements for different attributes. Extensive experiments on real datasets demonstrate that our solution offers desirable data utility.
Cite
Text
Tang et al. "Marginal Release Under Multi-Party Personalized Differential Privacy." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2022. doi:10.1007/978-3-031-26412-2_34Markdown
[Tang et al. "Marginal Release Under Multi-Party Personalized Differential Privacy." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2022.](https://mlanthology.org/ecmlpkdd/2022/tang2022ecmlpkdd-marginal/) doi:10.1007/978-3-031-26412-2_34BibTeX
@inproceedings{tang2022ecmlpkdd-marginal,
title = {{Marginal Release Under Multi-Party Personalized Differential Privacy}},
author = {Tang, Peng and Chen, Rui and Jin, Chongshi and Liu, Gaoyuan and Guo, Shanqing},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2022},
pages = {555-571},
doi = {10.1007/978-3-031-26412-2_34},
url = {https://mlanthology.org/ecmlpkdd/2022/tang2022ecmlpkdd-marginal/}
}