Li, Kunchang

22 publications

ICLR 2025 Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel Zun Wang, Jialu Li, Yicong Hong, Songze Li, Kunchang Li, Shoubin Yu, Yi Wang, Yu Qiao, Yali Wang, Mohit Bansal, Limin Wang
ICCV 2025 Make Your Training Flexible: Towards Deployment-Efficient Video Models Chenting Wang, Kunchang Li, Tianxiang Jiang, Xiangyu Zeng, Yi Wang, Limin Wang
AAAI 2025 Muses: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration Yanbo Ding, Shaobin Zhuang, Kunchang Li, Zhengrong Yue, Yu Qiao, Yali Wang
CVPR 2025 Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment Ziang Yan, Zhilin Li, Yinan He, Chenting Wang, Kunchang Li, Xinhao Li, Xiangyu Zeng, Zilei Wang, Yali Wang, Yu Qiao, Limin Wang, Yi Wang
ICML 2025 TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision Shaobin Zhuang, Yiwei Guo, Yanbo Ding, Kunchang Li, Xinyuan Chen, Yaohui Wang, Fangyikang Wang, Ying Zhang, Chen Li, Yali Wang
ICLR 2025 TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning Xiangyu Zeng, Kunchang Li, Chenting Wang, Xinhao Li, Tianxiang Jiang, Ziang Yan, Songze Li, Yansong Shi, Zhengrong Yue, Yi Wang, Yali Wang, Yu Qiao, Limin Wang
CVPR 2025 V-Stylist: Video Stylization via Collaboration and Reflection of MLLM Agents Zhengrong Yue, Shaobin Zhuang, Kunchang Li, Yanbo Ding, Yali Wang
ICLR 2024 InternVid: A Large-Scale Video-Text Dataset for Multimodal Understanding and Generation Yi Wang, Yinan He, Yizhuo Li, Kunchang Li, Jiashuo Yu, Xin Ma, Xinhao Li, Guo Chen, Xinyuan Chen, Yaohui Wang, Ping Luo, Ziwei Liu, Yali Wang, Limin Wang, Yu Qiao
ECCV 2024 InternVideo2: Scaling Foundation Models for Multimodal Video Understanding Yi Wang, Kunchang Li, Xinhao Li, Jiashuo Yu, Yinan He, Guo Chen, Baoqi Pei, Rongkun Zheng, Jilan Xu, Zun Wang, Yansong Shi, Tianxiang Jiang, SongZe Li, Hongjie Zhang, Yifei Huang, Yu Qiao, Yali Wang, Limin Wang
CVPR 2024 MVBench: A Comprehensive Multi-Modal Video Understanding Benchmark Kunchang Li, Yali Wang, Yinan He, Yizhuo Li, Yi Wang, Yi Liu, Zun Wang, Jilan Xu, Guo Chen, Ping Luo, Limin Wang, Yu Qiao
NeurIPS 2024 TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration Yiwei Guo, Shaobin Zhuang, Kunchang Li, Yu Qiao, Yali Wang
ECCV 2024 VideoMamba: State Space Model for Efficient Video Understanding Kunchang Li, Xinhao Li, Yi Wang, Yinan He, Yali Wang, Limin Wang, Yu Qiao
CVPR 2024 Vlogger: Make Your Dream a Vlog Shaobin Zhuang, Kunchang Li, Xinyuan Chen, Yaohui Wang, Ziwei Liu, Yu Qiao, Yali Wang
ICCV 2023 UniFormerV2: Unlocking the Potential of Image ViTs for Video Understanding Kunchang Li, Yali Wang, Yinan He, Yizhuo Li, Yi Wang, Limin Wang, Yu Qiao
ICCV 2023 Unmasked Teacher: Towards Training-Efficient Video Foundation Models Kunchang Li, Yali Wang, Yizhuo Li, Yi Wang, Yinan He, Limin Wang, Yu Qiao
ECCV 2022 MorphMLP: An Efficient MLP-like Backbone for Spatial-Temporal Representation Learning David Junhao Zhang, Kunchang Li, Yali Wang, Yunpeng Chen, Shashwat Chandra, Yu Qiao, Luoqi Liu, Mike Zheng Shou
CVPR 2022 PointCLIP: Point Cloud Understanding by CLIP Renrui Zhang, Ziyu Guo, Wei Zhang, Kunchang Li, Xupeng Miao, Bin Cui, Yu Qiao, Peng Gao, Hongsheng Li
WACV 2022 Pose-Guided Generative Adversarial Net for Novel View Action Synthesis Xianhang Li, Junhao Zhang, Kunchang Li, Shruti Vyas, Yogesh S. Rawat
ECCV 2022 Self-Slimmed Vision Transformer Zhuofan Zong, Kunchang Li, Guanglu Song, Yali Wang, Yu Qiao, Biao Leng, Yu Liu
ECCV 2022 Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification Renrui Zhang, Wei Zhang, Rongyao Fang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li
ICLR 2022 UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning Kunchang Li, Yali Wang, Gao Peng, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao
ICLR 2021 CT-Net: Channel Tensorization Network for Video Classification Kunchang Li, Xianhang Li, Yali Wang, Jun Wang, Yu Qiao