Attacks on Third-Party APIs of Large Language Models

Wanru Zhao, Vidit Khazanchi, Haodi Xing, Xuanli He, Qiongkai Xu, Nicholas Donald Lane

ICLRW 2024

/iclrw/2024/zhao2024iclrw-attacks/

Abstract

Large language model (LLM) services have recently begun offering a plugin ecosystem to interact with third-party API services. This innovation enhances the capabilities of LLMs but introduces risks since these plugins, developed by various third parties, cannot be easily trusted. This paper proposes a new attacking framework to examine security and safety vulnerabilities within LLM platforms that incorporate third-party services. Applying our framework specifically to widely used LLMs, we identify real-world malicious attacks across various domains on third-party APIs that can imperceptibly modify LLM outputs. The paper discusses the unique challenges posed by third-party API integration and offers strategic possibilities to improve the security and safety of LLM ecosystems moving forward.

PDF ICLRW OpenReview Semantic Scholar

Cite

Text

Zhao et al. "Attacks on Third-Party APIs of Large Language Models." ICLR 2024 Workshops: SeT_LLM, 2024.

Markdown

[Zhao et al. "Attacks on Third-Party APIs of Large Language Models." ICLR 2024 Workshops: SeT_LLM, 2024.](https://mlanthology.org/iclrw/2024/zhao2024iclrw-attacks/)

BibTeX

@inproceedings{zhao2024iclrw-attacks,
  title     = {{Attacks on Third-Party APIs of Large Language Models}},
  author    = {Zhao, Wanru and Khazanchi, Vidit and Xing, Haodi and He, Xuanli and Xu, Qiongkai and Lane, Nicholas Donald},
  booktitle = {ICLR 2024 Workshops: SeT_LLM},
  year      = {2024},
  url       = {https://mlanthology.org/iclrw/2024/zhao2024iclrw-attacks/}
}