Baraldi, Lorenzo
40 publications
CVPR
2025
Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-Based Visual Question Answering
CVPR
2025
Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval
NeurIPS
2025
vHector and HeisenVec: Scalable Vector Graphics Generation Through Large Language Models
ECCVW
2024
Optimizing Resource Consumption in Diffusion Models Through Hallucination Early Detection
ECCVW
2024
Optimizing Resource Consumption in Diffusion Models Through Hallucination Early Detection
NeurIPS
2024
Personalized Instance-Based Navigation Toward User-Specific Objects in Realistic Environments
ECCVW
2024
Personalizing Multimodal Large Language Models for Image Captioning: An Experimental Analysis
CVPR
2024
Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation
WACV
2024
What's Outside the Intersection? Fine-Grained Error Analysis for Semantic Segmentation Beyond IoU