Private Retrieval Augmented Generation with Random Projection

Abstract

Large Language Models (LLMs) have gained widespread interest and driven advancements across various fields. Retrieval-Augmented Generation (RAG) enables LLMs to incorporate domain-specific knowledge without retraining. However, evidence shows that RAG poses significant privacy risks due to leakage of sensitive information stored in the retrieval database. In this work, we propose a private randomized mechanism to project both the queries and the datastore into a lower-dimensional space using Gaussian matrices, while preserving the similarities for effective retrieval. Empirical evaluation on different RAG architectures demonstrates that our solution achieves strong empirical privacy protection with negligible impact on generation performance and latency compared to prior methods.

Cite

Text

Yao and Li. "Private Retrieval Augmented Generation with Random Projection." ICLR 2025 Workshops: BuildingTrust, 2025.

Markdown

[Yao and Li. "Private Retrieval Augmented Generation with Random Projection." ICLR 2025 Workshops: BuildingTrust, 2025.](https://mlanthology.org/iclrw/2025/yao2025iclrw-private/)

BibTeX

@inproceedings{yao2025iclrw-private,
  title     = {{Private Retrieval Augmented Generation with Random Projection}},
  author    = {Yao, Dixi and Li, Tian},
  booktitle = {ICLR 2025 Workshops: BuildingTrust},
  year      = {2025},
  url       = {https://mlanthology.org/iclrw/2025/yao2025iclrw-private/}
}