Zeng, Xiangyu

14 publications

ICLR 2026 Balancing the Experts: Unlocking LoRA-MoE for GRPO via Mechanism-Aware Rewards Changlian Ma, Zizheng Huang, Xiangyu Zeng, Yi Wang, Cheng Liang, Kun Tian, Xinhai Zhao, Limin Wang
ICLR 2026 RIVER: A Real-Time Interaction Benchmark for Video LLMs Yansong Shi, Qingsong Zhao, Tianxiang Jiang, Xiangyu Zeng, Yi Wang, Limin Wang
ICLR 2026 UniFlow: A Unified Pixel Flow Tokenizer for Visual Understanding and Generation Zhengrong Yue, Haiyu Zhang, Xiangyu Zeng, Boyu Chen, Chenting Wang, Shaobin Zhuang, Lu Dong, Yi Wang, Limin Wang, Yali Wang
ICLR 2026 VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling Xinhao Li, Yi Wang, Jiashuo Yu, Xiangyu Zeng, Yuhan Zhu, Haian Huang, Jianfei Gao, Kunchang Li, Yinan He, Chenting Wang, Yu Qiao, Yali Wang, Limin Wang
IJCAI 2025 Deduction with Induction: Combining Knowledge Discovery and Reasoning for Interpretable Deep Reinforcement Learning Haodi Zhang, Xiangyu Zeng, Junyang Chen, Yuanfeng Song, Rui Mao, Fangzhen Lin
ICCV 2025 Make Your Training Flexible: Towards Deployment-Efficient Video Models Chenting Wang, Kunchang Li, Tianxiang Jiang, Xiangyu Zeng, Yi Wang, Limin Wang
CVPR 2025 Online Video Understanding: OVBench and VideoChat-Online Zhenpeng Huang, Xinhao Li, Jiaqi Li, Jing Wang, Xiangyu Zeng, Cheng Liang, Tao Wu, Xi Chen, Liang Li, Limin Wang
NeurIPS 2025 StreamForest: Efficient Online Video Understanding with Persistent Event Memory Xiangyu Zeng, Kefan Qiu, Qingyu Zhang, Xinhao Li, Jing Wang, Jiaxin Li, Ziang Yan, Kun Tian, Meng Tian, Xinhai Zhao, Yi Wang, Limin Wang
CVPR 2025 Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment Ziang Yan, Zhilin Li, Yinan He, Chenting Wang, Kunchang Li, Xinhao Li, Xiangyu Zeng, Zilei Wang, Yali Wang, Yu Qiao, Limin Wang, Yi Wang
ICLR 2025 TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning Xiangyu Zeng, Kunchang Li, Chenting Wang, Xinhao Li, Tianxiang Jiang, Ziang Yan, Songze Li, Yansong Shi, Zhengrong Yue, Yi Wang, Yali Wang, Yu Qiao, Limin Wang
NeurIPS 2025 VideoChat-R1.5: Visual Test-Time Scaling to Reinforce Multimodal Reasoning by Iterative Perception Ziang Yan, Yinan He, Xinhao Li, Zhengrong Yue, Xiangyu Zeng, Yali Wang, Yu Qiao, Limin Wang, Yi Wang
ICLR 2024 A Framework for Inference Inspired by Human Memory Mechanisms Xiangyu Zeng, Jie Lin, Piao Hu, Ruizheng Huang, Zhicheng Zhang
AISTATS 2024 SDMTR: A Brain-Inspired Transformer for Relation Inference Xiangyu Zeng, Jie Lin, Piao Hu, Zhihao Li, Tianxi Huang
ICCV 2021 Single View Physical Distance Estimation Using Human Pose Xiaohan Fei, Henry Wang, Lin Lee Cheong, Xiangyu Zeng, Meng Wang, Joseph Tighe