Li, Xinhao

14 publications

NeurIPS 2025 GSPN-2: Efficient Parallel Sequence Modeling Hongjun Wang, Yitong Jiang, Collin McCarthy, David Wehr, Hanrong Ye, Xinhao Li, Ka Chun Cheung, Wonmin Byeon, Jinwei Gu, Ke Chen, Kai Han, Hongxu Yin, Pavlo Molchanov, Jan Kautz, Sifei Liu
ICML 2025 Learning to (Learn at Test Time): RNNs with Expressive Hidden States Yu Sun, Xinhao Li, Karan Dalal, Jiarui Xu, Arjun Vikram, Genghan Zhang, Yann Dubois, Xinlei Chen, Xiaolong Wang, Sanmi Koyejo, Tatsunori Hashimoto, Carlos Guestrin
NeurIPS 2025 LongVPO: From Anchored Cues to Self-Reasoning for Long-Form Video Preference Optimization Zhenpeng Huang, Jiaqi Li, Zihan Jia, Xinhao Li, Desen Meng, Lingxue Song, Xi Chen, Liang Li, Limin Wang
CVPR 2025 Online Video Understanding: OVBench and VideoChat-Online Zhenpeng Huang, Xinhao Li, Jiaqi Li, Jing Wang, Xiangyu Zeng, Cheng Liang, Tao Wu, Xi Chen, Liang Li, Limin Wang
NeurIPS 2025 StreamForest: Efficient Online Video Understanding with Persistent Event Memory Xiangyu Zeng, Kefan Qiu, Qingyu Zhang, Xinhao Li, Jing Wang, Jiaxin Li, Ziang Yan, Kun Tian, Meng Tian, Xinhai Zhao, Yi Wang, Limin Wang
CVPR 2025 Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment Ziang Yan, Zhilin Li, Yinan He, Chenting Wang, Kunchang Li, Xinhao Li, Xiangyu Zeng, Zilei Wang, Yali Wang, Yu Qiao, Limin Wang, Yi Wang
ICLR 2025 TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning Xiangyu Zeng, Kunchang Li, Chenting Wang, Xinhao Li, Tianxiang Jiang, Ziang Yan, Songze Li, Yansong Shi, Zhengrong Yue, Yi Wang, Yali Wang, Yu Qiao, Limin Wang
NeurIPS 2025 VideoChat-R1.5: Visual Test-Time Scaling to Reinforce Multimodal Reasoning by Iterative Perception Ziang Yan, Yinan He, Xinhao Li, Zhengrong Yue, Xiangyu Zeng, Yali Wang, Yu Qiao, Limin Wang, Yi Wang
ICLR 2024 InternVid: A Large-Scale Video-Text Dataset for Multimodal Understanding and Generation Yi Wang, Yinan He, Yizhuo Li, Kunchang Li, Jiashuo Yu, Xin Ma, Xinhao Li, Guo Chen, Xinyuan Chen, Yaohui Wang, Ping Luo, Ziwei Liu, Yali Wang, Limin Wang, Yu Qiao
ECCV 2024 InternVideo2: Scaling Foundation Models for Multimodal Video Understanding Yi Wang, Kunchang Li, Xinhao Li, Jiashuo Yu, Yinan He, Guo Chen, Baoqi Pei, Rongkun Zheng, Jilan Xu, Zun Wang, Yansong Shi, Tianxiang Jiang, SongZe Li, Hongjie Zhang, Yifei Huang, Yu Qiao, Yali Wang, Limin Wang
ECCV 2024 VideoMamba: State Space Model for Efficient Video Understanding Kunchang Li, Xinhao Li, Yi Wang, Yinan He, Yali Wang, Limin Wang, Yu Qiao
ECCV 2024 ZeroI2V: Zero-Cost Adaptation of Pre-Trained Transformers from Image to Video Xinhao Li, Yuhan Zhu, Limin Wang
ECCV 2022 Interpretable Open-Set Domain Adaptation via Angular Margin Separation Xinhao Li, Jingjing Li, Zhekai Du, Lei Zhu, Wen Li
AAAI 2021 MIMOSA: Multi-Constraint Molecule Sampling for Molecule Optimization Tianfan Fu, Cao Xiao, Xinhao Li, Lucas M. Glass, Jimeng Sun