Duan, Haodong

25 publications

ICCV 2025 Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLMs Xinyu Fang, Zhijian Chen, Kai Lan, Lixin Ma, Shengyuan Ding, Yingji Liang, Xiangyu Zhao, Farong Wen, Zicheng Zhang, Guofeng Zhang, Haodong Duan, Kai Chen, Dahua Lin
NeurIPS 2025 Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing Xiangyu Zhao, Peiyuan Zhang, Kexian Tang, Xiaorong Zhu, Hao Li, Wenhao Chai, Zicheng Zhang, Renqiu Xia, Guangtao Zhai, Junchi Yan, Hua Yang, Xue Yang, Haodong Duan
CVPR 2025 Image Quality Assessment: From Human to Machine Preference Chunyi Li, Yuan Tian, Xiaoyue Ling, Zicheng Zhang, Haodong Duan, Haoning Wu, Ziheng Jia, Xiaohong Liu, Xiongkuo Min, Guo Lu, Weisi Lin, Guangtao Zhai
ICCV 2025 Information Density Principle for MLLM Benchmarks Chunyi Li, Xiaozhe Li, Zicheng Zhang, Yuan Tian, Ziheng Jia, Xiaohong Liu, Xiongkuo Min, Jia Wang, Haodong Duan, Kai Chen, Guangtao Zhai
ICLR 2025 MIA-DPO: Multi-Image Augmented Direct Preference Optimization for Large Vision-Language Models Ziyu Liu, Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Haodong Duan, Conghui He, Yuanjun Xiong, Dahua Lin, Jiaqi Wang
ICCV 2025 MM-IFEngine: Towards Multimodal Instruction Following Shengyuan Ding, Shenxi Wu, Xiangyu Zhao, Yuhang Zang, Haodong Duan, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Dahua Lin, Jiaqi Wang
TMLR 2025 NeedleBench: Evaluating LLM Retrieval and Reasoning Across Varying Information Densities Mo Li, Songyang Zhang, Taolin Zhang, Haodong Duan, Yunxin Liu, Kai Chen
CVPR 2025 OVO-Bench: How Far Is Your Video-LLMs from Real-World Online Video Understanding? Junbo Niu, Yifei Li, Ziyang Miao, Chunjiang Ge, Yuanhang Zhou, Qihao He, Xiaoyi Dong, Haodong Duan, Shuangrui Ding, Rui Qian, Pan Zhang, Yuhang Zang, Yuhang Cao, Conghui He, Jiaqi Wang
ICML 2025 VideoRoPE: What Makes for Good Video Rotary Position Embedding? Xilin Wei, Xiaoran Liu, Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Jian Tong, Haodong Duan, Qipeng Guo, Jiaqi Wang, Xipeng Qiu, Dahua Lin
ICCV 2025 Visual-RFT: Visual Reinforcement Fine-Tuning Ziyu Liu, Zeyi Sun, Yuhang Zang, Xiaoyi Dong, Yuhang Cao, Haodong Duan, Dahua Lin, Jiaqi Wang
NeurIPS 2024 Are We on the Right Way for Evaluating Large Vision-Language Models? Lin Chen, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Jiaqi Wang, Yu Qiao, Dahua Lin, Feng Zhao
NeurIPS 2024 GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI Pengcheng Chen, Jin Ye, Guoan Wang, Yanjun Li, Zhongying Deng, Wei Li, Tianbin Li, Haodong Duan, Ziyan Huang, Yanzhou Su, Benyou Wang, Shaoting Zhang, Bin Fu, Jianfei Cai, Bohan Zhuang, Eric J Seibel, Yu Qiao, Junjun He
NeurIPS 2024 InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4k HD Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang
ECCV 2024 MMBENCH: Is Your Multi-Modal Model an All-Around Player? Yuan Liu, Haodong Duan, Yuanhan Zhang, Bo Li, Songyang Zhang, Wangbo Zhao, Yike Yuan, Jiaqi Wang, Conghui He, Ziwei Liu, Kai Chen, Dahua Lin
NeurIPS 2024 MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding Xinyu Fang, Kangrui Mao, Haodong Duan, Xiangyu Zhao, Yining Li, Dahua Lin, Kai Chen
NeurIPS 2024 Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs Yuxuan Qiao, Haodong Duan, Xinyu Fang, Junming Yang, Lin Chen, Songyang Zhang, Jiaqi Wang, Dahua Lin, Kai Chen
NeurIPS 2024 ShareGPT4Video: Improving Video Understanding and Generation with Better Captions Lin Chen, Xilin Wei, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Bin Lin, Zhenyu Tang, Li Yuan, Yu Qiao, Dahua Lin, Feng Zhao, Jiaqi Wang
NeurIPS 2023 JourneyDB: A Benchmark for Generative Image Understanding Keqiang Sun, Junting Pan, Yuying Ge, Hao Li, Haodong Duan, Xiaoshi Wu, Renrui Zhang, Aojun Zhou, Zipeng Qin, Yi Wang, Jifeng Dai, Yu Qiao, Limin Wang, Hongsheng Li
AAAI 2023 Self-Supervised Action Representation Learning from Partial Spatio-Temporal Skeleton Sequences Yujie Zhou, Haodong Duan, Anyi Rao, Bing Su, Jiaqi Wang
ICCV 2023 SkeleTR: Towards Skeleton-Based Action Recognition in the Wild Haodong Duan, Mingze Xu, Bing Shuai, Davide Modolo, Zhuowen Tu, Joseph Tighe, Alessandro Bergamo
ECCVW 2022 Mitigating Representation Bias in Action Recognition: Algorithms and Benchmarks Haodong Duan, Yue Zhao, Kai Chen, Yuanjun Xiong, Dahua Lin
CVPR 2022 OCSampler: Compressing Videos to One CLIP with Single-Step Sampling Jintao Lin, Haodong Duan, Kai Chen, Dahua Lin, Limin Wang
CVPR 2022 Revisiting Skeleton-Based Action Recognition Haodong Duan, Yue Zhao, Kai Chen, Dahua Lin, Bo Dai
CVPR 2022 TransRank: Self-Supervised Video Representation Learning via Ranking-Based Transformation Recognition Haodong Duan, Nanxuan Zhao, Kai Chen, Dahua Lin
ECCV 2020 Omni-Sourced Webly-Supervised Learning for Video Recognition Haodong Duan, Yue Zhao, Yuanjun Xiong, Wentao Liu, Dahua Lin