Behind the Magic, MERLIM: Multi-Modal Evaluation Benchmark for Large Image-Language Models

Cite

Text

Villa et al. "Behind the Magic, MERLIM: Multi-Modal Evaluation Benchmark for Large Image-Language Models." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.

Markdown

[Villa et al. "Behind the Magic, MERLIM: Multi-Modal Evaluation Benchmark for Large Image-Language Models." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.](https://mlanthology.org/cvprw/2025/villa2025cvprw-behind/)

BibTeX

@inproceedings{villa2025cvprw-behind,
  title     = {{Behind the Magic, MERLIM: Multi-Modal Evaluation Benchmark for Large Image-Language Models}},
  author    = {Villa, Andrés and Alcázar, Juan León and Soto, Alvaro and Ghanem, Bernard},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2025},
  pages     = {492-502},
  url       = {https://mlanthology.org/cvprw/2025/villa2025cvprw-behind/}
}