Cucchiara, Rita
101 publications
CVPR
2025
Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-Based Visual Question Answering
IJCAI
2025
Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives
CVPR
2025
Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval
ECCVW
2024
Optimizing Resource Consumption in Diffusion Models Through Hallucination Early Detection
NeurIPS
2024
Personalized Instance-Based Navigation Toward User-Specific Objects in Realistic Environments
ECCVW
2024
Personalizing Multimodal Large Language Models for Image Captioning: An Experimental Analysis
CVPR
2024
Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation
WACV
2024
What's Outside the Intersection? Fine-Grained Error Analysis for Semantic Segmentation Beyond IoU
ICCV
2023
Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing
ICCVW
2023
Volumetric Fast Fourier Convolution for Detecting Ink on the Carbonized Herculaneum Papyri
CVPRW
2022
The Unreasonable Effectiveness of CLIP Features for Image Captioning: An Experimental Analysis
CVPRW
2016
DR(eye)VE: A Dataset for Attention-Based Tasks with Applications to Autonomous and Assisted Driving
CVPRW
2012
Understanding Dyadic Interactions Applying Proxemic Theory on Videosurveillance Trajectories
CVPRW
2011
Energy-Efficient Foreground Object Detection on Embedded Smart Cameras by Hardware-Level Operations