Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives

Cite

Text

Sarto et al. "Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives." International Joint Conference on Artificial Intelligence, 2025. doi:10.24963/IJCAI.2025/1180

Markdown

[Sarto et al. "Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives." International Joint Conference on Artificial Intelligence, 2025.](https://mlanthology.org/ijcai/2025/sarto2025ijcai-image/) doi:10.24963/IJCAI.2025/1180

BibTeX

@inproceedings{sarto2025ijcai-image,
  title     = {{Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives}},
  author    = {Sarto, Sara and Cornia, Marcella and Cucchiara, Rita},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {10632-10640},
  doi       = {10.24963/IJCAI.2025/1180},
  url       = {https://mlanthology.org/ijcai/2025/sarto2025ijcai-image/}
}