G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o

Cite

Text

Tong et al. "G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I7.32798

Markdown

[Tong et al. "G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/tong2025aaai-g/) doi:10.1609/AAAI.V39I7.32798

BibTeX

@inproceedings{tong2025aaai-g,
  title     = {{G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o}},
  author    = {Tong, Tony Cheng and He, Sirui and Shao, Zhiwen and Yeung, Dit-Yan},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {7419-7427},
  doi       = {10.1609/AAAI.V39I7.32798},
  url       = {https://mlanthology.org/aaai/2025/tong2025aaai-g/}
}