IVEBench: Modern Benchmark Suite for Instruction-Guided Video Editing Assessment

Abstract

Instruction-guided video editing has emerged as a rapidly advancing research direction, offering new opportunities for intuitive content transformation while also posing significant challenges for systematic evaluation. Existing video editing benchmarks fail to support the evaluation of instruction-guided video editing adequately and further suffer from limited source diversity, narrow task coverage and incomplete evaluation metrics. To address above limitations, we introduce IVEBench, a modern benchmark suite specifically designed for instruction-guided video editing assessment. IVEBench comprises a diverse database of 600 high-quality source videos, spanning seven semantic dimensions, and covering video lengths ranging from 32 to 1,024 frames. It further includes 8 categories of editing tasks with 35 subcategories, whose prompts are generated and refined through large language models and expert review. Crucially, IVEBench establishes a three-dimensional evaluation protocol encompassing video quality, instruction compliance and video fidelity, integrating both traditional metrics and multimodal large language model-based assessments. Extensive experiments demonstrate the effectiveness of IVEBench in benchmarking state-of-the-art instruction-guided video editing methods, showing its ability to provide comprehensive and human-aligned evaluation outcomes. All data and code will be made publicly available.

Cite

Text

Chen et al. "IVEBench: Modern Benchmark Suite for Instruction-Guided Video Editing Assessment." International Conference on Learning Representations, 2026.

Markdown

[Chen et al. "IVEBench: Modern Benchmark Suite for Instruction-Guided Video Editing Assessment." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/chen2026iclr-ivebench/)

BibTeX

@inproceedings{chen2026iclr-ivebench,
  title     = {{IVEBench: Modern Benchmark Suite for Instruction-Guided Video Editing Assessment}},
  author    = {Chen, Yinan and Zhang, Jiangning and Hu, Teng and Zeng, Yuxiang and Xue, Zhucun and He, Qingdong and Wang, Chengjie and Liu, Yong and Hu, Xiaobin and Yan, Shuicheng},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/chen2026iclr-ivebench/}
}