AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models

Li, Kai; Shen, Can; Liu, Yile; Han, Jirui; Zheng, Kelong; Zou, Xuechao; Wang, Lionel Z.; Zhang, Shun; Du, Xingjian; Luo, Hanjun; Jin, Yingbin; Xing, Xinxin; Ma, Ziyang; Liu, Yue; Zhang, YiFan; Fang, Junfeng; Wang, Kun; Yan, Yibo; Deng, Gelei; Li, Haoyang; Li, Yiming; Zhuang, Xiaobin; Chen, Tianlong; Wen, Qingsong; Zhang, Tianwei; Liu, Yang; Hu, Haibo; Wu, Zhizheng; Hu, Xiaolin; Chng, Eng Siong; Xu, Wenyuan; Wang, XiaoFeng; Dong, Wei; Li, Xinfeng

AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models

ICLR 2026

/iclr/2026/li2026iclr-audiotrust/

Abstract

The rapid development and widespread adoption of Audio Large Language Models (ALLMs) require a rigorous assessment of their trustworthiness. However, existing evaluation frameworks, primarily designed for text, are not equipped to handle the unique vulnerabilities introduced by audio’s acoustic properties. We find that significant trustworthiness risks in ALLMs arise from non-semantic acoustic cues, such as timbre, accent, and background noise, which can be used to manipulate model behavior. To address this gap, we propose AudioTrust, the first framework for large-scale and systematic evaluation of ALLM trustworthiness concerning these audio-specific risks. AudioTrust spans six key dimensions: fairness, hallucination, safety, privacy, robustness, and authenticition. It is implemented through 26 distinct sub-tasks and a curated dataset of over 4,420 audio samples collected from real-world scenarios (e.g., daily conversations, emergency calls, and voice assistant interactions), purposefully constructed to probe the trustworthiness of ALLMs across multiple dimensions. Our comprehensive evaluation includes 18 distinct experimental configurations and employs human-validated automated pipelines to objectively and scalably quantify model outputs. Experimental results reveal the boundaries and limitations of 14 state-of-the-art (SOTA) open-source and closed-source ALLMs when confronted with diverse high-risk audio scenarios, thereby offering critical insights into the secure and trustworthy deployment of future audio models. Our platform and benchmark are publicly available at https://github.com/JusperLee/AudioTrust.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Li et al. "AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models." International Conference on Learning Representations, 2026.

Markdown

[Li et al. "AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/li2026iclr-audiotrust/)

BibTeX

@inproceedings{li2026iclr-audiotrust,
  title     = {{AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models}},
  author    = {Li, Kai and Shen, Can and Liu, Yile and Han, Jirui and Zheng, Kelong and Zou, Xuechao and Wang, Lionel Z. and Zhang, Shun and Du, Xingjian and Luo, Hanjun and Jin, Yingbin and Xing, Xinxin and Ma, Ziyang and Liu, Yue and Zhang, YiFan and Fang, Junfeng and Wang, Kun and Yan, Yibo and Deng, Gelei and Li, Haoyang and Li, Yiming and Zhuang, Xiaobin and Chen, Tianlong and Wen, Qingsong and Zhang, Tianwei and Liu, Yang and Hu, Haibo and Wu, Zhizheng and Hu, Xiaolin and Chng, Eng Siong and Xu, Wenyuan and Wang, XiaoFeng and Dong, Wei and Li, Xinfeng},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/li2026iclr-audiotrust/}
}