A-Bench: Are LMMs Masters at Evaluating AI-Generated Images?

Abstract

How to accurately and efficiently assess AI-generated images (AIGIs) remains a critical challenge for generative models. Given the high costs and extensive time commitments required for user studies, many researchers have turned towards employing large multi-modal models (LMMs) as AIGI evaluators, the precision and validity of which are still questionable. Furthermore, traditional benchmarks often utilize mostly natural-captured content rather than AIGIs to test the abilities of LMMs, leading to a noticeable gap for AIGIs. Therefore, we introduce **A-Bench** in this paper, a benchmark designed to diagnose *whether LMMs are masters at evaluating AIGIs*. Specifically, **A-Bench** is organized under two key principles: 1) Emphasizing both high-level semantic understanding and low-level visual quality perception to address the intricate demands of AIGIs. 2) Various generative models are utilized for AIGI creation, and various LMMs are employed for evaluation, which ensures a comprehensive validation scope. Ultimately, 2,864 AIGIs from 16 text-to-image models are sampled, each paired with question-answers annotated by human experts. We hope that **A-Bench** will significantly enhance the evaluation process and promote the generation quality for AIGIs.

Cite

Text

Zhang et al. "A-Bench: Are LMMs Masters at Evaluating AI-Generated Images?." International Conference on Learning Representations, 2025.

Markdown

[Zhang et al. "A-Bench: Are LMMs Masters at Evaluating AI-Generated Images?." International Conference on Learning Representations, 2025.](https://mlanthology.org/iclr/2025/zhang2025iclr-abench/)

BibTeX

@inproceedings{zhang2025iclr-abench,
  title     = {{A-Bench: Are LMMs Masters at Evaluating AI-Generated Images?}},
  author    = {Zhang, Zicheng and Wu, Haoning and Li, Chunyi and Zhou, Yingjie and Sun, Wei and Min, Xiongkuo and Chen, Zijian and Liu, Xiaohong and Lin, Weisi and Zhai, Guangtao},
  booktitle = {International Conference on Learning Representations},
  year      = {2025},
  url       = {https://mlanthology.org/iclr/2025/zhang2025iclr-abench/}
}