Yan, Ming

46 publications

CVPR 2025 AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization Yiyang Du, Xiaochen Wang, Chi Chen, Jiabo Ye, Yiru Wang, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Zhifang Sui, Maosong Sun, Yang Liu
CVPR 2025 ClimbingCap: Multi-Modal Dataset and Method for Rock Climbing in World Coordinate Ming Yan, Xincheng Lin, Yuhua Luo, Shuqi Fan, Yudi Dai, Qixin Zhong, Lincai Zhong, Yuexin Ma, Lan Xu, Chenglu Wen, Siqi Shen, Cheng Wang
ICLR 2025 Endowing Visual Reprogramming with Adversarial Robustness Shengjie Zhou, Xin Cheng, Haiyang Xu, Ming Yan, Tao Xiang, Feng Liu, Lei Feng
ICML 2025 Exploiting Presentative Feature Distributions for Parameter-Efficient Continual Learning of Large Language Models Xin Cheng, Jiabo Ye, Haiyang Xu, Ming Yan, Ji Zhang, Feng Liu, Fei Huang, Lei Feng
ICLRW 2025 Interpretable Steering of Large Language Models with Feature Guided Activation Additions Samuel Soo, Wesley Teng, Chandrasekaran Balaganesh, Tan Guoxian, Ming Yan
NeurIPS 2025 Look Before You Leap: A GUI-Critic-R1 Model for Pre-Operative Error Diagnosis in GUI Automation Yuyang Wanyan, Xi Zhang, Haiyang Xu, Haowei Liu, Junyang Wang, Jiabo Ye, Yutong Kou, Ming Yan, Fei Huang, Xiaoshan Yang, Weiming Dong, Changsheng Xu
ICLRW 2025 PC-Agent: A Hierarchical Agentic Framework for Complex Task Automation on PC Haowei Liu, Xi Zhang, Haiyang Xu, Yuyang Wanyan, Junyang Wang, Ming Yan, Ji Zhang, Chunfeng Yuan, Changsheng Xu, Weiming Hu, Fei Huang
AAAI 2025 RoDA: Robust Domain Alignment for Cross-Domain Retrieval Against Label Noise Ziniu Yin, Yanglin Feng, Ming Yan, Xiaomin Song, Dezhong Peng, Xu Wang
CVPR 2025 SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization Hongrui Jia, Chaoya Jiang, Haiyang Xu, Wei Ye, Mengfan Dong, Ming Yan, Ji Zhang, Fei Huang, Shikun Zhang
ECML-PKDD 2025 Target-Adaptive Structure-Semantic Consistency for Unsupervised Graph Domain Adaptation Yan Zou, Yongzheng Lu, Na Li, Xiatian Zhu, Lan Du, Ming Yan, Ying Ma
ICML 2025 Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning Lang Feng, Weihao Tan, Zhiyi Lyu, Longtao Zheng, Haiyang Xu, Ming Yan, Fei Huang, Bo An
NeurIPS 2025 VLM-R³: Region Recognition, Reasoning, and Refinement for Enhanced Multimodal Chain-of-Thought Chaoya Jiang, Yongrui Heng, Wei Ye, Haiyang Xu, Ming Yan, Ji Zhang, Fei Huang, Shikun Zhang
NeurIPS 2025 WritingBench: A Comprehensive Benchmark for Generative Writing Yuning Wu, Jiahao Mei, Ming Yan, Chenliang Li, Shaopeng Lai, Yuran Ren, Wang Zijia, Ji Zhang, Mengyue Wu, Qin Jin, Fei Huang
ICLR 2025 mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models Jiabo Ye, Haiyang Xu, Haowei Liu, Anwen Hu, Ming Yan, Qi Qian, Ji Zhang, Fei Huang, Jingren Zhou
IJCAI 2024 Breaking Barriers of System Heterogeneity: Straggler-Tolerant Multimodal Federated Learning via Knowledge Distillation Jinqian Chen, Haoyu Tang, Junhao Cheng, Ming Yan, Ji Zhang, Mingzhu Xu, Yupeng Hu, Liqiang Nie
AAAI 2024 DiDA: Disambiguated Domain Alignment for Cross-Domain Retrieval with Partial Labels Haoran Liu, Ying Ma, Ming Yan, Yingke Chen, Dezhong Peng, Xu Wang
ACML 2024 FTP: A Human Pose Estimation Method Integrating Temporal and Fine-Grained Feature Fusion Shuqiang Cai, Chennan Ma, Xin Wang, Li Lin, Ming Yan, Xincheng Lin, Shuqi Fan, Siqi Shen
CVPR 2024 Hallucination Augmented Contrastive Learning for Multimodal Large Language Model Chaoya Jiang, Haiyang Xu, Mengfan Dong, Jiaxing Chen, Wei Ye, Ming Yan, Qinghao Ye, Ji Zhang, Fei Huang, Shikun Zhang
NeurIPS 2024 MaVEn: An Effective Multi-Granularity Hybrid Visual Encoding Framework for Multimodal Large Language Model Chaoya Jiang, Hongrui Jia, Haiyang Xu, Wei Ye, Mengfan Dong, Ming Yan, Ji Zhang, Fei Huang, Shikun Zhang
NeurIPS 2024 Mobile-Agent-V2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration Junyang Wang, Haiyang Xu, Haitao Jia, Xi Zhang, Ming Yan, Weizhou Shen, Ji Zhang, Fei Huang, Jitao Sang
ICLRW 2024 Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception Junyang Wang, Haiyang Xu, Jiabo Ye, Ming Yan, Weizhou Shen, Ji Zhang, Fei Huang, Jitao Sang
CVPR 2024 RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method Ming Yan, Yan Zhang, Shuqiang Cai, Shuqi Fan, Xincheng Lin, Yudi Dai, Siqi Shen, Chenglu Wen, Lan Xu, Yuexin Ma, Cheng Wang
AAAI 2024 TiMix: Text-Aware Image Mixing for Effective Vision-Language Pre-Training Chaoya Jiang, Wei Ye, Haiyang Xu, Qinghao Ye, Ming Yan, Ji Zhang, Shikun Zhang
CVPR 2024 mPLUG-Owl2: Revolutionizing Multi-Modal Large Language Model with Modality Collaboration Qinghao Ye, Haiyang Xu, Jiabo Ye, Ming Yan, Anwen Hu, Haowei Liu, Qi Qian, Ji Zhang, Fei Huang
ICCV 2023 BUS: Efficient and Effective Vision-Language Pre-Training with Bottom-up Patch Summarization. Chaoya Jiang, Haiyang Xu, Wei Ye, Qinghao Ye, Chenliang Li, Ming Yan, Bin Bi, Shikun Zhang, Fei Huang, Songfang Huang
CVPR 2023 CIMI4D: A Large Multimodal Climbing Motion Dataset Under Human-Scene Interactions Ming Yan, Xin Wang, Yudi Dai, Siqi Shen, Chenglu Wen, Lan Xu, Yuexin Ma, Cheng Wang
AAAI 2023 Correspondence-Free Domain Alignment for Unsupervised Cross-Domain Image Retrieval Xu Wang, Dezhong Peng, Ming Yan, Peng Hu
IJCAI 2023 From Association to Generation: Text-Only Captioning by Unsupervised Cross-Modal Mapping Junyang Wang, Ming Yan, Yi Zhang, Jitao Sang
ICCV 2023 HiTeA: Hierarchical Temporal-Aware Video-Language Pre-Training Qinghao Ye, Guohai Xu, Ming Yan, Haiyang Xu, Qi Qian, Ji Zhang, Fei Huang
ICCV 2023 Improved Visual Fine-Tuning with Natural Language Supervision Junyang Wang, Yuanhong Xu, Juhua Hu, Ming Yan, Jitao Sang, Qi Qian
ICCV 2023 Learning Trajectory-Word Alignments for Video-Language Tasks Xu Yang, Zhangzikang Li, Haiyang Xu, Hanwang Zhang, Qinghao Ye, Chenliang Li, Ming Yan, Yu Zhang, Fei Huang, Songfang Huang
ICML 2023 mPLUG-2: A Modularized Multi-Modal Foundation Model Across Text, Image and Video Haiyang Xu, Qinghao Ye, Ming Yan, Yaya Shi, Jiabo Ye, Yuanhong Xu, Chenliang Li, Bin Bi, Qi Qian, Wei Wang, Guohai Xu, Ji Zhang, Songfang Huang, Fei Huang, Jingren Zhou
NeurIPS 2022 Communication-Efficient Topologies for Decentralized Learning with $o(1)$ Consensus Rate Zhuoqing Song, Weijian Li, Kexin Jin, Lei Shi, Ming Yan, Wotao Yin, Kun Yuan
IJCAI 2022 DictBERT: Dictionary Description Knowledge Enhanced Language Model Pre-Training via Contrastive Learning Qianglong Chen, Feng-Lin Li, Guohai Xu, Ming Yan, Ji Zhang, Yin Zhang
NeurIPS 2022 FedRolex: Model-Heterogeneous Federated Learning with Rolling Sub-Model Extraction Samiul Alam, Luyang Liu, Ming Yan, Mi Zhang
CVPR 2022 Shifting More Attention to Visual Backbone: Query-Modulated Refinement Networks for End-to-End Visual Grounding Jiabo Ye, Junfeng Tian, Ming Yan, Xiaoshan Yang, Xuwu Wang, Ji Zhang, Liang He, Xin Lin
AAAI 2021 A Unified Pretraining Framework for Passage Ranking and Expansion Ming Yan, Chenliang Li, Bin Bi, Wei Wang, Songfang Huang
ICML 2021 Elastic Graph Neural Networks Xiaorui Liu, Wei Jin, Yao Ma, Yaxin Li, Hua Liu, Yiqi Wang, Ming Yan, Jiliang Tang
NeurIPS 2021 ErrorCompensatedX: Error Compensation for Variance Reduced Algorithms Hanlin Tang, Yao Li, Ji Liu, Ming Yan
ICLR 2021 Linear Convergent Decentralized Optimization with Compression Xiaorui Liu, Yao Li, Rongrong Wang, Jiliang Tang, Ming Yan
AISTATS 2020 A Double Residual Compression Algorithm for Efficient Distributed Learning Xiaorui Liu, Yao Li, Jiliang Tang, Ming Yan
AAAI 2020 Generating Well-Formed Answers by Machine Reading with Stochastic Selector Networks Bin Bi, Chen Wu, Ming Yan, Wei Wang, Jiangnan Xia, Chenliang Li
ICLR 2020 StructBERT: Incorporating Language Structures into Pre-Training for Deep Language Understanding Wei Wang, Bin Bi, Ming Yan, Chen Wu, Zuyi Bao, Jiangnan Xia, Liwei Peng, Luo Si
AAAI 2019 A Deep Cascade Model for Multi-Document Reading Comprehension Ming Yan, Jiangnan Xia, Chen Wu, Bin Bi, Zhongzhou Zhao, Ji Zhang, Luo Si, Rui Wang, Wei Wang, Haiqing Chen
NeurIPS 2019 Manifold Denoising by Nonlinear Robust Principal Component Analysis He Lyu, Ningyu Sha, Shuyang Qin, Ming Yan, Yuying Xie, Rongrong Wang
ICML 2018 $d^2$: Decentralized Training over Decentralized Data Hanlin Tang, Xiangru Lian, Ming Yan, Ce Zhang, Ji Liu