Lu, Zhiwu
50 publications
ICCV
2025
CoTMR: Chain-of-Thought Multi-Scale Reasoning for Training-Free Zero-Shot Composed Image Retrieval
AAAI
2025
Leveraging Large Vision-Language Model as User Intent-Aware Encoder for Composed Image Retrieval
ICLR
2025
MMRole: A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents
CVPR
2022
COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval
NeurIPS
2022
Fine-Grained Analysis of Stability and Generalization for Modern Meta Learning Algorithms