PrivAuditor: Benchmarking Data Protection Vulnerabilities in LLM Adaptation Techniques

Zhu, Derui; Chen, Dingfan; Wu, Xiongfei; Geng, Jiahui; Li, Zhuo; Grossklags, Jens; Ma, Lei

doi:10.52202/079017-0308

PrivAuditor: Benchmarking Data Protection Vulnerabilities in LLM Adaptation Techniques

Derui Zhu, Dingfan Chen, Xiongfei Wu, Jiahui Geng, Zhuo Li, Jens Grossklags, Lei Ma

NeurIPS 2024

doi:10.52202/079017-0308 /neurips/2024/zhu2024neurips-privauditor/

Abstract

Large Language Models (LLMs) are recognized for their potential to be an important building block toward achieving artificial general intelligence due to their unprecedented capability for solving diverse tasks. Despite these achievements, LLMs often underperform in domain-specific tasks without training on relevant domain data. This phenomenon, which is often attributed to distribution shifts, makes adapting pre-trained LLMs with domain-specific data crucial. However, this adaptation raises significant privacy concerns, especially when the data involved come from sensitive domains. In this work, we extensively investigate the privacy vulnerabilities of adapted (fine-tuned) LLMs and benchmark privacy leakage across a wide range of data modalities, state-of-the-art privacy attack methods, adaptation techniques, and model architectures. We systematically evaluate and pinpoint critical factors related to privacy leakage. With our organized codebase and actionable insights, we aim to provide a standardized auditing tool for practitioners seeking to deploy customized LLM applications with faithful privacy assessments.

PDF NeurIPS OpenReview Semantic Scholar

Cite

Text

Zhu et al. "PrivAuditor: Benchmarking Data Protection Vulnerabilities in LLM Adaptation Techniques." Neural Information Processing Systems, 2024. doi:10.52202/079017-0308

Markdown

[Zhu et al. "PrivAuditor: Benchmarking Data Protection Vulnerabilities in LLM Adaptation Techniques." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/zhu2024neurips-privauditor/) doi:10.52202/079017-0308

BibTeX

@inproceedings{zhu2024neurips-privauditor,
  title     = {{PrivAuditor: Benchmarking Data Protection Vulnerabilities in LLM Adaptation Techniques}},
  author    = {Zhu, Derui and Chen, Dingfan and Wu, Xiongfei and Geng, Jiahui and Li, Zhuo and Grossklags, Jens and Ma, Lei},
  booktitle = {Neural Information Processing Systems},
  year      = {2024},
  doi       = {10.52202/079017-0308},
  url       = {https://mlanthology.org/neurips/2024/zhu2024neurips-privauditor/}
}