Learning to Generate Image Embeddings with User-Level Differential Privacy

Abstract

Small on-device models have been successfully trained with user-level differential privacy (DP) for next word prediction and image classification tasks in the past. However, existing methods can fail when directly applied to learn embedding models using supervised training data with a large class space. To achieve user-level DP for large image-to-embedding feature extractors, we propose DP-FedEmb, a variant of federated learning algorithms with per-user sensitivity control and noise addition, to train from user-partitioned data centralized in datacenter. DP-FedEmb combines virtual clients, partial aggregation, private local fine-tuning, and public pretraining to achieve strong privacy utility trade-offs. We apply DP-FedEmb to train image embedding models for faces, landmarks and natural species, and demonstrate its superior utility under same privacy budget on benchmark datasets DigiFace, GLD and iNaturalist. We further illustrate it is possible to achieve strong user-level DP guarantees of epsilon < 2 while controlling the utility drop within 5%, when millions of users can participate in training.

Cite

Text

Xu et al. "Learning to Generate Image Embeddings with User-Level Differential Privacy." Conference on Computer Vision and Pattern Recognition, 2023. doi:10.1109/CVPR52729.2023.00770

Markdown

[Xu et al. "Learning to Generate Image Embeddings with User-Level Differential Privacy." Conference on Computer Vision and Pattern Recognition, 2023.](https://mlanthology.org/cvpr/2023/xu2023cvpr-learning/) doi:10.1109/CVPR52729.2023.00770

BibTeX

@inproceedings{xu2023cvpr-learning,
  title     = {{Learning to Generate Image Embeddings with User-Level Differential Privacy}},
  author    = {Xu, Zheng and Collins, Maxwell and Wang, Yuxiao and Panait, Liviu and Oh, Sewoong and Augenstein, Sean and Liu, Ting and Schroff, Florian and McMahan, H. Brendan},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2023},
  pages     = {7969-7980},
  doi       = {10.1109/CVPR52729.2023.00770},
  url       = {https://mlanthology.org/cvpr/2023/xu2023cvpr-learning/}
}