Baraldi, Lorenzo

40 publications

CVPR 2025 Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-Based Visual Question Answering Federico Cocchi, Nicholas Moratelli, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara
ICLR 2025 Causal Graphical Models for Vision-Language Compositional Understanding Fiorenzo Parascandolo, Nicholas Moratelli, Enver Sangineto, Lorenzo Baraldi, Rita Cucchiara
CVPR 2025 Hyperbolic Safety-Aware Vision-Language Models Tobia Poppi, Tejaswi Kasarla, Pascal Mettes, Lorenzo Baraldi, Rita Cucchiara
ICCV 2025 MissRAG: Addressing the Missing Modality Challenge in Multimodal Large Language Models Vittorio Pipoli, Alessia Saporita, Federico Bolelli, Marcella Cornia, Lorenzo Baraldi, Costantino Grana, Rita Cucchiara, Elisa Ficarra
WACV 2025 Perceive Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries Roberto Amoroso, Gengyuan Zhang, Rajat Koner, Lorenzo Baraldi, Rita Cucchiara, Volker Tresp
CVPR 2025 Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval Davide Caffagni, Sara Sarto, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara
WACV 2025 Semantically Conditioned Prompts for Visual Recognition Under Missing Modality Scenarios Vittorio Pipoli, Federico Bolelli, Sara Sarto, Marcella Cornia, Lorenzo Baraldi, Costantino Grana, Rita Cucchiara, Elisa Ficarra
ICCV 2025 Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation Luca Barsellotti, Lorenzo Bianchi, Nicola Messina, Fabio Carrara, Marcella Cornia, Lorenzo Baraldi, Fabrizio Falchi, Rita Cucchiara
ICCV 2025 What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models Lorenzo Baraldi, Davide Bucciarelli, Federico Betti, Marcella Cornia, Lorenzo Baraldi, Nicu Sebe, Rita Cucchiara
ICCV 2025 What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models Lorenzo Baraldi, Davide Bucciarelli, Federico Betti, Marcella Cornia, Lorenzo Baraldi, Nicu Sebe, Rita Cucchiara
NeurIPS 2025 vHector and HeisenVec: Scalable Vector Graphics Generation Through Large Language Models Leonardo Zini, Elia Frigieri, Sebastiano Aloscari, Lorenzo Baraldi
CVPRW 2024 AIGeN: An Adversarial Approach for Instruction Generation in VLN Niyati Rawal, Roberto Bigazzi, Lorenzo Baraldi, Rita Cucchiara
ECCV 2024 BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues Sara Sarto, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara
ECCV 2024 Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities Lorenzo Baraldi, Federico Cocchi, Marcella Cornia, Lorenzo Baraldi, Alessandro Nicolosi, Rita Cucchiara
ECCV 2024 Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities Lorenzo Baraldi, Federico Cocchi, Marcella Cornia, Lorenzo Baraldi, Alessandro Nicolosi, Rita Cucchiara
WACV 2024 FOSSIL: Free Open-Vocabulary Semantic Segmentation Through Synthetic References Retrieval Luca Barsellotti, Roberto Amoroso, Lorenzo Baraldi, Rita Cucchiara
ECCVW 2024 Optimizing Resource Consumption in Diffusion Models Through Hallucination Early Detection Federico Betti, Lorenzo Baraldi, Lorenzo Baraldi, Rita Cucchiara, Nicu Sebe
ECCVW 2024 Optimizing Resource Consumption in Diffusion Models Through Hallucination Early Detection Federico Betti, Lorenzo Baraldi, Lorenzo Baraldi, Rita Cucchiara, Nicu Sebe
NeurIPS 2024 Personalized Instance-Based Navigation Toward User-Specific Objects in Realistic Environments Luca Barsellotti, Roberto Bigazzi, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara
ECCVW 2024 Personalizing Multimodal Large Language Models for Image Captioning: An Experimental Analysis Davide Bucciarelli, Nicholas Moratelli, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara
ECCV 2024 Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models Samuele Poppi, Tobia Poppi, Federico Cocchi, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara
CVPR 2024 Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation Luca Barsellotti, Roberto Amoroso, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara
WACV 2024 What's Outside the Intersection? Fine-Grained Error Analysis for Semantic Segmentation Beyond IoU Maximilian Bernhard, Roberto Amoroso, Yannic Kindermann, Lorenzo Baraldi, Rita Cucchiara, Volker Tresp, Matthias Schubert
CVPRW 2024 Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs Davide Caffagni, Federico Cocchi, Nicholas Moratelli, Sara Sarto, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara
CVPR 2023 Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation Sara Sarto, Manuele Barraco, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara
ICCV 2023 With a Little Help from Your Own past: Prototypical Memory Networks for Image Captioning Manuele Barraco, Sara Sarto, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara
CVPRW 2022 Dual-Branch Collaborative Transformer for Virtual Try-on Emanuele Fenocchi, Davide Morelli, Marcella Cornia, Lorenzo Baraldi, Fabio Cesari, Rita Cucchiara
CVPRW 2022 The Unreasonable Effectiveness of CLIP Features for Image Captioning: An Experimental Analysis Manuele Barraco, Marcella Cornia, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara
CVPRW 2021 Estimating (and Fixing) the Effect of Face Obfuscation in Video Recognition Matteo Tomei, Lorenzo Baraldi, Simone Bronzin, Rita Cucchiara
CVPRW 2021 Revisiting the Evaluation of Class Activation Mapping for Explainability: A Novel Metric and Experimental Analysis Samuele Poppi, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara
CVPRW 2018 SAM: Pushing the Limits of Saliency Prediction Models Marcella Cornia, Lorenzo Baraldi, Giuseppe Serra, Rita Cucchiara
ECCVW 2018 Towards Cycle-Consistent Models for Text and Image Retrieval Marcella Cornia, Lorenzo Baraldi, Hamed R. Tavakoli, Rita Cucchiara
ECCVW 2018 Visual-Semantic Alignment Across Domains Using a Semi-Supervised Approach Angelo Carraggi, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara
ECCVW 2018 What Was Monet Seeing While Painting? Translating Artworks to Photo-Realistic Images Matteo Tomei, Lorenzo Baraldi, Marcella Cornia, Rita Cucchiara
CVPR 2017 Hierarchical Boundary-Aware Neural Encoder for Video Captioning Lorenzo Baraldi, Costantino Grana, Rita Cucchiara
ECCV 2016 Context Change Detection for an Ultra-Low Power Low-Resolution Ego-Vision Imager Francesco Paci, Lorenzo Baraldi, Giuseppe Serra, Rita Cucchiara, Luca Benini
ECCVW 2016 Context Change Detection for an Ultra-Low Power Low-Resolution Ego-Vision Imager Francesco Paci, Lorenzo Baraldi, Giuseppe Serra, Rita Cucchiara, Luca Benini
ECCV 2016 Multi-Level Net: A Visual Saliency Prediction Model Marcella Cornia, Lorenzo Baraldi, Giuseppe Serra, Rita Cucchiara
ECCVW 2016 Multi-Level Net: A Visual Saliency Prediction Model Marcella Cornia, Lorenzo Baraldi, Giuseppe Serra, Rita Cucchiara
CVPRW 2014 Gesture Recognition in Ego-Centric Videos Using Dense Trajectories and Hand Segmentation Lorenzo Baraldi, Francesco Paci, Giuseppe Serra, Luca Benini, Rita Cucchiara