TrustGen: A Platform of Dynamic Benchmarking on the Trustworthiness of Generative Foundation Models

Abstract

Generative foundation models (GenFMs), such as large language models and text-to-image systems, have demonstrated remarkable capabilities in various downstream applications. As they are increasingly deployed in high-stakes applications, assessing their trustworthiness has become both a critical necessity and a substantial challenge. Existing evaluation efforts are fragmented, rapidly outdated, and often lack extensibility across modalities. This raises a fundamental question: how can we systematically, reliably, and continuously assess the trustworthiness of rapidly advancing GenFMs across diverse modalities and use cases? To address these gaps, we introduce TrustGen, a dynamic and modular benchmarking system designed to systematically evaluate the trustworthiness of GenFMs across text-to-image, large language, and vision-language modalities. TrustGen standardizes trust evaluation through a unified taxonomy of over 25 fine-grained dimensions—including truthfulness, safety, fairness, robustness, privacy, and machine ethics—while supporting dynamic data generation and adaptive evaluation through three core modules: Metadata Curator, Test Case Builder, and Contextual Variator. Taking TrustGen into action to evaluate the trustworthiness of 39 models reveals four key insights. (1) State-of-the-art GenFMs achieve promising overall trust performance, yet significant limitations remain in specific dimensions such as hallucination resistance, fairness, and privacy preservation. (2) Contrary to prevailing assumptions, open-source models now rival and occasionally surpass proprietary systems in trustworthiness metrics. (3) The trust gap among top-performing models is narrowing, likely due to increased industry convergence on best practices. (4) Trustworthiness is not an isolated property; it interacts complexly with other behaviors, such as helpfulness and ethical decision-making. TrustGen is a transformative step toward standardized, scalable, and actionable trustworthiness evaluation, supporting dynamic assessments across diverse modalities and trust dimensions that evolve alongside the generative AI landscape.

Cite

Text

Huang et al. "TrustGen: A Platform of Dynamic Benchmarking on the Trustworthiness of Generative Foundation Models." International Conference on Learning Representations, 2026.

Markdown

[Huang et al. "TrustGen: A Platform of Dynamic Benchmarking on the Trustworthiness of Generative Foundation Models." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/huang2026iclr-trustgen/)

BibTeX

@inproceedings{huang2026iclr-trustgen,
  title     = {{TrustGen: A Platform of Dynamic Benchmarking on the Trustworthiness of Generative Foundation Models}},
  author    = {Huang, Yue and Gao, Chujie and Wu, Siyuan and Wang, Haoran and Wang, Xiangqi and Ye, Jiayi and Zhou, Yujun and Wang, Yanbo and Shi, Jiawen and Zhang, Qihui and Bao, Han and Liu, Zhaoyi and Li, Yuan and Guan, Tianrui and Wang, Peiran and Zhuang, Haomin and Chen, Dongping and Guo, Kehan and Zou, Andy and Hooi, Bryan and Xiong, Caiming and Stengel-Eskin, Elias and Zhang, Hongyang and Yin, Hongzhi and Zhang, Huan and Yao, Huaxiu and Zhang, Jieyu and Yoon, Jaehong and Shu, Kai and Krishna, Ranjay and Swayamdipta, Swabha and Shi, Weijia and Li, Xiang and Hao, Yuexing and Jia, Zhihao and Li, Zhize and Chen, Xiuying and Tu, Zhengzhong and Hu, Xiyang and Zhou, Tianyi and Zhao, Jieyu and Sun, Lichao and Huang, Furong and Cohen-Sasson, Or and Sattigeri, Prasanna and Reuel, Anka and Lamparth, Max and Zhao, Yue and Dziri, Nouha and Su, Yu and Sun, Huan and Ji, Heng and Xiao, Chaowei and Bansal, Mohit and Chawla, Nitesh V and Pei, Jian and Gao, Jianfeng and Backes, Michael and Yu, Philip S. and Gong, Neil Zhenqiang and Chen, Pin-Yu and Li, Bo and Song, Dawn and Zhang, Xiangliang},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/huang2026iclr-trustgen/}
}