Shu, Yan

6 publications

CVPR 2025 MLVU: Benchmarking Multi-Task Long Video Understanding Junjie Zhou, Yan Shu, Bo Zhao, Boya Wu, Zhengyang Liang, Shitao Xiao, Minghao Qin, Xi Yang, Yongping Xiong, Bo Zhang, Tiejun Huang, Zheng Liu
CVPR 2025 Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding Yan Shu, Zheng Liu, Peitian Zhang, Minghao Qin, Junjie Zhou, Zhengyang Liang, Tiejun Huang, Bo Zhao
NeurIPS 2025 When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding Yan Shu, Hangui Lin, Yexin Liu, Yan Zhang, Gangyan Zeng, Yan Li, Yu Zhou, Ser-Nam Lim, Harry Yang, Nicu Sebe
NeurIPS 2024 TextCtrl: Diffusion-Based Scene Text Editing with Prior Guidance Control Weichao Zeng, Yan Shu, Zhenhang Li, Dongbao Yang, Yu Zhou
ICCV 2021 Condensing a Sequence to One Informative Frame for Video Recognition Zhaofan Qiu, Ting Yao, Yan Shu, Chong-Wah Ngo, Tao Mei
AISTATS 2020 A Characterization of Mean Squared Error for Estimator with Bagging Martin Mihelich, Charles Dognin, Yan Shu, Michael Blot