I2EBench: A Comprehensive Benchmark for Instruction-Based Image Editing
Abstract
Significant progress has been made in the field of Instruction-based Image Editing (IIE). However, evaluating these models poses a significant challenge. A crucial requirement in this field is the establishment of a comprehensive evaluation benchmark for accurately assessing editing results and providing valuable insights for its further development. In response to this need, we propose I2EBench, a comprehensive benchmark designed to automatically evaluate the quality of edited images produced by IIE models from multiple dimensions. I2EBench consists of 2,000+ images for editing, along with 4,000+ corresponding original and diverse instructions. It offers three distinctive characteristics: 1) Comprehensive Evaluation Dimensions: I2EBench comprises 16 evaluation dimensions that cover both high-level and low-level aspects, providing a comprehensive assessment of each IIE model. 2) Human Perception Alignment: To ensure the alignment of our benchmark with human perception, we conducted an extensive user study for each evaluation dimension. 3) Valuable Research Insights: By analyzing the advantages and disadvantages of existing IIE models across the 16 dimensions, we offer valuable research insights to guide future development in the field. We will open-source I2EBench, including all instructions, input images, human annotations, edited images from all evaluated methods, and a simple script for evaluating the results from new IIE models. The code, dataset, and generated images from all IIE models are provided in GitHub: https://github.com/cocoshe/I2EBench.
Cite
Text
Ma et al. "I2EBench: A Comprehensive Benchmark for Instruction-Based Image Editing." Neural Information Processing Systems, 2024. doi:10.52202/079017-1313Markdown
[Ma et al. "I2EBench: A Comprehensive Benchmark for Instruction-Based Image Editing." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/ma2024neurips-i2ebench/) doi:10.52202/079017-1313BibTeX
@inproceedings{ma2024neurips-i2ebench,
title = {{I2EBench: A Comprehensive Benchmark for Instruction-Based Image Editing}},
author = {Ma, Yiwei and Ji, Jiayi and Ye, Ke and Lin, Weihuang and Wang, Zhibin and Zheng, Yonghan and Zhou, Qiang and Sun, Xiaoshuai and Ji, Rongrong},
booktitle = {Neural Information Processing Systems},
year = {2024},
doi = {10.52202/079017-1313},
url = {https://mlanthology.org/neurips/2024/ma2024neurips-i2ebench/}
}