Soto, Alvaro
18 publications
CVPRW
2025
Behind the Magic, MERLIM: Multi-Modal Evaluation Benchmark for Large Image-Language Models
WACV
2019
Interpretable Visual Question Answering by Visual Grounding from Attention Supervision Mining
IJCAI
2017
How a General-Purpose Commonsense Ontology Can Improve Performance of Learning-Based Image Retrieval
WACV
2015
Visual Recognition to Access and Analyze People Density and Flow Patterns in Indoor Environments