Zheng, Yun
22 publications
NeurIPS
2025
CAPability: A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness
ICCV
2025
DynImg: Key Frames with Visual Prompts Are Good Representation for Multi-Modal Video Understanding
CVPR
2025
Hybrid-Level Instruction Injection for Video Token Compression in Multi-Modal Large Language Models
AAAI
2025
Orchestrating the Symphony of Prompt Distribution Learning for Human-Object Interaction Detection
NeurIPS
2025
UFO: A Unified Approach to Fine-Grained Visual Perception via Open-Ended Language Interface
NeurIPS
2023
Dual Mean-Teacher: An Unbiased Semi-Supervised Framework for Audio-Visual Source Localization