Wei, Cong

14 publications

ICCV 2025 Advancing Visual Large Language Model for Multi-Granular Versatile Perception Wentao Xiang, Haoxian Tan, Yujie Zhong, Cong Wei, Dengjie Li, Yujiu Yang
CVPR 2025 HyperSeg: Hybrid Segmentation Assistant with Fine-Grained Visual Perceiver Cong Wei, Yujie Zhong, Haoxian Tan, Yong Liu, Jie Hu, Dengjie Li, Zheng Zhao, Yujiu Yang
ICCV 2025 InstructSeg: Unifying Instructed Visual Segmentation with Multi-Modal Large Language Models Cong Wei, Yujie Zhong, Haoxian Tan, Yingsen Zeng, Yong Liu, Hongfa Wang, Yujiu Yang
NeurIPS 2025 MoCha: Towards Movie-Grade Talking Character Generation Cong Wei, Bo Sun, Haoyu Ma, Ji Hou, Felix Juefei-Xu, Zecheng He, Xiaoliang Dai, Luxin Zhang, Kunpeng Li, Tingbo Hou, Animesh Sinha, Peter Vajda, Wenhu Chen
ICLR 2025 OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision Cong Wei, Zheyang Xiong, Weiming Ren, Xeron Du, Ge Zhang, Wenhu Chen
CVPR 2025 VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation Weiming Ren, Huan Yang, Jie Min, Cong Wei, Wenhu Chen
ICCV 2025 Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers Weiming Ren, Wentao Ma, Huan Yang, Cong Wei, Ge Zhang, Wenhu Chen
TMLR 2024 AnyV2V: A Tuning-Free Framework for Any Video-to-Video Editing Tasks Max Ku, Cong Wei, Weiming Ren, Huan Yang, Wenhu Chen
TMLR 2024 ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation Weiming Ren, Huan Yang, Ge Zhang, Cong Wei, Xinrun Du, Wenhao Huang, Wenhu Chen
CVPR 2024 MMMU: A Massive Multi-Discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI Xiang Yue, Yuansheng Ni, Kai Zhang, Tianyu Zheng, Ruoqi Liu, Ge Zhang, Samuel Stevens, Dongfu Jiang, Weiming Ren, Yuxuan Sun, Cong Wei, Botao Yu, Ruibin Yuan, Renliang Sun, Ming Yin, Boyuan Zheng, Zhenzhu Yang, Yibo Liu, Wenhao Huang, Huan Sun, Yu Su, Wenhu Chen
TMLR 2024 Mantis: Interleaved Multi-Image Instruction Tuning Dongfu Jiang, Xuan He, Huaye Zeng, Cong Wei, Max Ku, Qian Liu, Wenhu Chen
ECCV 2024 UniIR: Training and Benchmarking Universal Multimodal Information Retrievers Cong Wei, Yang Chen, Haonan Chen, Hexiang Hu, Ge Zhang, Jie Fu, Alan Ritter, Wenhu Chen
TMLR 2023 DreamEdit: Subject-Driven Image Editing Tianle Li, Max Ku, Cong Wei, Wenhu Chen
CVPR 2023 Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers Cong Wei, Brendan Duke, Ruowei Jiang, Parham Aarabi, Graham W. Taylor, Florian Shkurti