Sun, Xing

57 publications

ICML 2025 DS-VLM: Diffusion Supervision Vision Language Model Zhen Sun, Yunhang Shen, Jie Li, Xing Sun, Pingyang Dai, Liujuan Cao, Rongrong Ji
ICML 2025 FlexiReID: Adaptive Mixture of Expert for Multi-Modal Person Re-Identification Zhen Sun, Lei Tan, Yunhang Shen, Chengmao Cai, Xing Sun, Pingyang Dai, Liujuan Cao, Rongrong Ji
ICML 2025 Freeze-Omni: A Smart and Low Latency Speech-to-Speech Dialogue Model with Frozen LLM Xiong Wang, Yangze Li, Chaoyou Fu, Yike Zhang, Yunhang Shen, Lei Xie, Ke Li, Xing Sun, Long Ma
NeurIPS 2025 Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models Yulei Qin, Gang Li, Zongyi Li, Zihan Xu, Yuchen Shi, Zhekai Lin, Xiao Cui, Ke Li, Xing Sun
NeurIPS 2025 LTD-Bench: Evaluating Large Language Models by Letting Them Draw Liuhao Lin, Ke Li, Zihan Xu, Yuchen Shi, Yulei Qin, Yan Zhang, Xing Sun, Rongrong Ji
ICLR 2025 Learning Interleaved Image-Text Comprehension in Vision-Language Large Models Chenyu Zhou, Mengdan Zhang, Peixian Chen, Chaoyou Fu, Yunhang Shen, Xiawu Zheng, Xing Sun, Rongrong Ji
NeurIPS 2025 MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models Chaoyou Fu, Peixian Chen, Yunhang Shen, Yulei Qin, Mengdan Zhang, Xu Lin, Jinrui Yang, Xiawu Zheng, Ke Li, Xing Sun, Yunsheng Wu, Rongrong Ji, Caifeng Shan, Ran He
AAAI 2025 Probability-Density-Aware Semi-Supervised Learning Shuyang Liu, Ruiqiu Zheng, Yunhang Shen, Zhou Yu, Ke Li, Xing Sun, Shaohui Lin
ICLR 2025 RocketEval: Efficient Automated LLM Evaluation via Grading Checklist Tianjun Wei, Wei Wen, Ruizhi Qiao, Xing Sun, Jianghong Ma
NeurIPS 2025 TransMLA: Migrating GQA Models to MLA with Full DeepSeek Compatibility and Speedup Fanxu Meng, Pingzhi Tang, Zengwei Yao, Xing Sun, Muhan Zhang
TMLR 2025 Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models Yulei Qin, Yuncheng Yang, Pengcheng Guo, Gang Li, Hang Shao, Yuchen Shi, Zihan Xu, Yun Gu, Ke Li, Xing Sun
NeurIPS 2025 VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction Chaoyou Fu, Haojia Lin, Xiong Wang, YiFan Zhang, Yunhang Shen, Xiaoyu Liu, Haoyu Cao, Zuwei Long, Heting Gao, Ke Li, Long Ma, Xiawu Zheng, Rongrong Ji, Xing Sun, Caifeng Shan, Ran He
NeurIPS 2025 VITA-Audio: Fast Interleaved Audio-Text Token Generation for Efficient Large Speech-Language Model Zuwei Long, Yunhang Shen, Chaoyou Fu, Heting Gao, Lijiang Li, Peixian Chen, Mengdan Zhang, Hang Shao, Jian Li, Jinlong Peng, Haoyu Cao, Ke Li, Rongrong Ji, Xing Sun
CVPR 2025 Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-Modal LLMs in Video Analysis Chaoyou Fu, Yuhan Dai, Yongdong Luo, Lei Li, Shuhuai Ren, Renrui Zhang, Zihan Wang, Chenyu Zhou, Yunhang Shen, Mengdan Zhang, Peixian Chen, Yanwei Li, Shaohui Lin, Sirui Zhao, Ke Li, Tong Xu, Xiawu Zheng, Enhong Chen, Caifeng Shan, Ran He, Xing Sun
NeurIPS 2025 Zooming from Context to Cue: Hierarchical Preference Optimization for Multi-Image MLLMs Xudong Li, Mengdan Zhang, Peixian Chen, Xiawu Zheng, Yan Zhang, Jingyuan Zheng, Yunhang Shen, Ke Li, Chaoyou Fu, Xing Sun, Rongrong Ji
CVPR 2024 A General and Efficient Training for Transformer via Token Expansion Wenxuan Huang, Yunhang Shen, Jiao Xie, Baochang Zhang, Gaoqi He, Ke Li, Xing Sun, Shaohui Lin
CVPR 2024 Aligning and Prompting Everything All at Once for Universal Visual Perception Yunhang Shen, Chaoyou Fu, Peixian Chen, Mengdan Zhang, Ke Li, Xing Sun, Yunsheng Wu, Shaohui Lin, Rongrong Ji
CVPR 2024 Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models Xin Li, Yunfei Wu, Xinghua Jiang, Zhihao Guo, Mingming Gong, Haoyu Cao, Yinsong Liu, Deqiang Jiang, Xing Sun
AAAI 2024 Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation Hao Liu, Xin Li, Mingming Gong, Bing Liu, Yunfei Wu, Deqiang Jiang, Yinsong Liu, Xing Sun
CVPR 2024 HRVDA: High-Resolution Visual Document Assistant Chaohu Liu, Kun Yin, Haoyu Cao, Xinghua Jiang, Xin Li, Yinsong Liu, Deqiang Jiang, Xing Sun, Linli Xu
ECCV 2024 Multimodal Label Relevance Ranking via Reinforcement Learning Taian Guo, Taolin Zhang, Haoqian Wu, Hanjun Li, Ruizhi Qiao, Xing Sun
AAAI 2024 SPD-DDPM: Denoising Diffusion Probabilistic Models in the Symmetric Positive Definite Space Yunchen Li, Zhou Yu, Gaoqi He, Yunhang Shen, Ke Li, Xing Sun, Shaohui Lin
AAAI 2024 SoftCLIP: Softer Cross-Modal Alignment Makes CLIP Stronger Yuting Gao, Jinfeng Liu, Zihan Xu, Tong Wu, Enwei Zhang, Ke Li, Jie Yang, Wei Liu, Xing Sun
AAAI 2024 Visual Hallucination Elevates Speech Recognition Fang Zhang, Yongxin Zhu, Xiangxiang Wang, Huang Chen, Xing Sun, Linli Xu
ICCV 2023 Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration Haoyu Cao, Changcun Bao, Chaohu Liu, Huang Chen, Kun Yin, Hao Liu, Yinsong Liu, Deqiang Jiang, Xing Sun
NeurIPS 2023 CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes Yulei Qin, Xingyu Chen, Yunhang Shen, Chaoyou Fu, Yun Gu, Ke Li, Xing Sun, Rongrong Ji
ICCV 2023 Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval Yunquan Zhu, Xinkai Gao, Bo Ke, Ruizhi Qiao, Xing Sun
ICCV 2023 D3G: Exploring Gaussian Prior for Temporal Sentence Grounding with Glance Annotation Hanjun Li, Xiujun Shu, Sunan He, Ruizhi Qiao, Wei Wen, Taian Guo, Bei Gan, Xing Sun
WACV 2023 Graph-Based Self-Learning for Robust Person Re-Identification Yuqiao Xian, Jinrui Yang, Fufu Yu, Jun Zhang, Xing Sun
ICLR 2023 Mitigating Memorization of Noisy Labels via Regularization Between Representations Hao Cheng, Zhaowei Zhu, Xing Sun, Yang Liu
ICLR 2022 AS-MLP: An Axial Shifted MLP Architecture for Vision Dongze Lian, Zehao Yu, Xing Sun, Shenghua Gao
CVPR 2022 DIFNet: Boosting Visual Information Flow for Image Captioning Mingrui Wu, Xuying Zhang, Xiaoshuai Sun, Yiyi Zhou, Chao Chen, Jiaxin Gu, Xing Sun, Rongrong Ji
ECCV 2022 DisCo: Remedying Self-Supervised Learning on Lightweight Models with Distilled Contrastive Learning Yuting Gao, Jia-Xin Zhuang, Shaohui Lin, Hao Cheng, Xing Sun, Ke Li, Chunhua Shen
ECCV 2022 Efficient Decoder-Free Object Detection with Transformers Peixian Chen, Mengdan Zhang, Yunhang Shen, Kekai Sheng, Yuting Gao, Xing Sun, Ke Li, Chunhua Shen
AAAI 2022 Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer Yifan Xu, Zhijie Zhang, Mengdan Zhang, Kekai Sheng, Ke Li, Weiming Dong, Liqing Zhang, Changsheng Xu, Xing Sun
ECCV 2022 PAC-Net: Highlight Your Video via History Preference Modeling Hang Wang, Penghao Zhou, Chong Zhou, Zhao Zhang, Xing Sun
ICML 2022 Self-Supervised Models Are Good Teaching Assistants for Vision Transformers Haiyan Wu, Yuting Gao, Yinqi Zhang, Shaohui Lin, Yuan Xie, Xing Sun, Ke Li
CVPR 2022 Training-Free Transformer Architecture Search Qinqin Zhou, Kekai Sheng, Xiawu Zheng, Ke Li, Xing Sun, Yonghong Tian, Jie Chen, Rongrong Ji
ICCV 2021 Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query Guanyu Cai, Jun Zhang, Xinyang Jiang, Yifei Gong, Lianghua He, Fufu Yu, Pai Peng, Xiaowei Guo, Feiyue Huang, Xing Sun
IJCAI 2021 Dig into Multi-Modal Cues for Video Retrieval with Hierarchical Alignment Wenzhe Wang, Mengdan Zhang, Runnan Chen, Guanyu Cai, Penghao Zhou, Pai Peng, Xiaowei Guo, Jian Wu, Xing Sun
AAAI 2021 Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion Jinpeng Wang, Yuting Gao, Ke Li, Jianguo Hu, Xinyang Jiang, Xiaowei Guo, Rongrong Ji, Xing Sun
CVPR 2021 Learning 3D Shape Feature for Texture-Insensitive Person Re-Identification Jiaxing Chen, Xinyang Jiang, Fudong Wang, Jun Zhang, Feng Zheng, Xing Sun, Wei-Shi Zheng
ICCV 2021 Learning Canonical View Representation for 3D Shape Recognition with Arbitrary Views Xin Wei, Yifei Gong, Fudong Wang, Xing Sun, Jian Sun
ICCV 2021 Learning to Know Where to See: A Visibility-Aware Approach for Occluded Person Re-Identification Jinrui Yang, Jiawei Zhang, Fufu Yu, Xinyang Jiang, Mengdan Zhang, Xing Sun, Ying-Cong Chen, Wei-Shi Zheng
ICLR 2021 Learning with Instance-Dependent Label Noise: A Sample Sieve Approach Hao Cheng, Zhaowei Zhu, Xingyu Li, Yifei Gong, Xing Sun, Yang Liu
AAAI 2021 One for More: Selecting Generalizable Samples for Generalizable ReID Model Enwei Zhang, Xinyang Jiang, Hao Cheng, Ancong Wu, Fufu Yu, Ke Li, Xiaowei Guo, Feng Zheng, Wei-Shi Zheng, Xing Sun
ICCV 2021 PR-Net: Preference Reasoning for Personalized Video Highlight Detection Runnan Chen, Penghao Zhou, Wenzhe Wang, Nenglun Chen, Pai Peng, Xing Sun, Wenping Wang
CVPR 2021 Removing the Background by Adding the Background: Towards Background Robust Self-Supervised Video Representation Learning Jinpeng Wang, Yuting Gao, Ke Li, Yiqi Lin, Andy J. Ma, Hao Cheng, Pai Peng, Feiyue Huang, Rongrong Ji, Xing Sun
CVPR 2021 Temporal Modulation Network for Controllable Space-Time Video Super-Resolution Gang Xu, Jun Xu, Zhen Li, Liang Wang, Xing Sun, Ming-Ming Cheng
AAAI 2020 Asymmetric Co-Teaching for Unsupervised Cross-Domain Person Re-Identification Fengxiang Yang, Ke Li, Zhun Zhong, Zhiming Luo, Xing Sun, Hao Cheng, Xiaowei Guo, Feiyue Huang, Rongrong Ji, Shaozi Li
ECCV 2020 Do Not Disturb Me: Person Re-Identification Under the Interference of Other Pedestrians Shizhen Zhao, Changxin Gao, Jun Zhang, Hao Cheng, Chuchu Han, Xinyang Jiang, Xiaowei Guo, Wei-Shi Zheng, Nong Sang, Xing Sun
NeurIPS 2020 Pruning Filter in Filter Fanxu Meng, Hao Cheng, Ke Li, Huixiang Luo, Xiaowei Guo, Guangming Lu, Xing Sun
AAAI 2020 Rethinking Temporal Fusion for Video-Based Person Re-Identification on Semantic and Time Aspect Xinyang Jiang, Yifei Gong, Xiaowei Guo, Qize Yang, Feiyue Huang, Wei-Shi Zheng, Feng Zheng, Xing Sun
AAAI 2020 Viewpoint-Aware Loss with Angular Regularization for Person Re-Identification Zhihui Zhu, Xinyang Jiang, Feng Zheng, Xiaowei Guo, Feiyue Huang, Xing Sun, Wei-Shi Zheng
ICCVW 2019 The Seventh Visual Object Tracking VOT2019 Challenge Results Matej Kristan, Amanda Berg, Linyu Zheng, Litu Rout, Luc Van Gool, Luca Bertinetto, Martin Danelljan, Matteo Dunnhofer, Meng Ni, Min Young Kim, Ming Tang, Ming-Hsuan Yang, Abdelrahman Eldesokey, Naveen Paluru, Niki Martinel, Pengfei Xu, Pengfei Zhang, Pengkun Zheng, Pengyu Zhang, Philip H. S. Torr, Qi Zhang, Qiang Wang, Qing Guo, Radu Timofte, Jani Käpylä, Rama Krishna Sai Subrahmanyam Gorthi, Richard M. Everson, Ruize Han, Ruohan Zhang, Shan You, Shao-Chuan Zhao, Shengwei Zhao, Shihu Li, Shikun Li, Shiming Ge, Gustavo Fernández, Shuai Bai, Shuosen Guan, Tengfei Xing, Tianyang Xu, Tianyu Yang, Ting Zhang, Tomás Vojír, Wei Feng, Weiming Hu, Weizhao Wang, Abel Gonzalez-Garcia, Wenjie Tang, Wenjun Zeng, Wenyu Liu, Xi Chen, Xi Qiu, Xiang Bai, Xiao-Jun Wu, Xiaoyun Yang, Xier Chen, Xin Li, Alireza Memarmoghadam, Xing Sun, Xingyu Chen, Xinmei Tian, Xu Tang, Xuefeng Zhu, Yan Huang, Yanan Chen, Yanchao Lian, Yang Gu, Yang Liu, Andong Lu, Yanjie Chen, Yi Zhang, Yinda Xu, Yingming Wang, Yingping Li, Yu Zhou, Yuan Dong, Yufei Xu, Yunhua Zhang, Yunkun Li, Anfeng He, Zeyu Wang, Zhao Luo, Zhaoliang Zhang, Zhenhua Feng, Zhenyu He, Zhichao Song, Zhihao Chen, Zhipeng Zhang, Zhirong Wu, Zhiwei Xiong, Zhongjian Huang, Anton Varfolomieiev, Zhu Teng, Zihan Ni, Antoni B. Chan, Jirí Matas, Ardhendu Shekhar Tripathi, Arnold W. M. Smeulders, Bala Suraj Pedasingu, Bao Xin Chen, Baopeng Zhang, Baoyuan Wu, Bi Li, Bin He, Bin Yan, Bing Bai, Ales Leonardis, Bing Li, Bo Li, Byeong Hak Kim, Chao Ma, Chen Fang, Chen Qian, Cheng Chen, Chenglong Li, Chengquan Zhang, Chi-Yi Tsai, Michael Felsberg, Chong Luo, Christian Micheloni, Chunhui Zhang, Dacheng Tao, Deepak Gupta, Dejia Song, Dong Wang, Efstratios Gavves, Eunu Yi, Fahad Shahbaz Khan, Roman P. Pflugfelder, Fangyi Zhang, Fei Wang, Fei Zhao, George De Ath, Goutam Bhat, Guangqi Chen, Guangting Wang, Guoxuan Li, Hakan Cevikalp, Hao Du, Joni-Kristian Kämäräinen, Haojie Zhao, Hasan Saribas, Ho Min Jung, Hongliang Bai, Hongyuan Yu, Houwen Peng, Huchuan Lu, Hui Li, Jiakun Li, Luka Cehovin Zajc, Jianhua Li, Jianlong Fu, Jie Chen, Jie Gao, Jie Zhao, Jin Tang, Jing Li, Jingjing Wu, Jingtuo Liu, Jinqiao Wang, Ondrej Drbohlav, Jinqing Qi, Jinyue Zhang, John K. Tsotsos, Jong Hyuk Lee, Joost van de Weijer, Josef Kittler, Jun Ha Lee, Junfei Zhuang, Kangkai Zhang, Kangkang Wang, Alan Lukezic, Kenan Dai, Lei Chen, Lei Liu, Leida Guo, Li Zhang, Liang Wang, Liangliang Wang, Lichao Zhang, Lijun Wang, Lijun Zhou
JMLR 2008 On the Size and Recovery of Submatrices of Ones in a Random Binary Matrix Xing Sun, Andrew B. Nobel
COLT 2006 Significance and Recovery of Block Structures in Binary Matrices with Noise Xing Sun, Andrew B. Nobel