He, Xuming

81 publications

TMLR 2025 DA-DPO: Cost-Efficient Difficulty-Aware Preference Optimization for Reducing MLLM Hallucinations Longtian Qiu, Shan Ning, Chuyu Zhang, Jiaxuan Sun, Xuming He
NeurIPS 2025 GUI-Rise: Structured Reasoning and History Summarization for GUI Navigation Tao Liu, Chongyu Wang, Rongjie Li, Yingchen Yu, Xuming He, Song Bai
ICCV 2025 GeoDistill: Geometry-Guided Self-Distillation for Weakly Supervised Cross-View Localization Shaowen Tong, Zimin Xia, Alexandre Alahi, Xuming He, Yujiao Shi
NeurIPS 2025 LithoSim: A Large, Holistic Lithography Simulation Benchmark for AI-Driven Semiconductor Manufacturing Hongquan He, Zhen Wang, Jingya Wang, Tao Wu, Xuming He, Bei Yu, Jingyi Yu, Hao Geng
NeurIPS 2025 NoisyGRPO: Incentivizing Multimodal CoT Reasoning via Noise Injection and Bayesian Estimation Longtian Qiu, Shan Ning, Jiaxuan Sun, Xuming He
NeurIPS 2025 RadarQA: Multi-Modal Quality Analysis of Weather Radar Forecasts Xuming He, Zhiyuan You, Junchao Gong, Couhua Liu, Xiaoyu Yue, Peiqin Zhuang, Wenlong Zhang, Lei Bai
AAAI 2025 Relation-Aware Hierarchical Prompt for Open-Vocabulary Scene Graph Generation Tao Liu, Rongjie Li, Chongyu Wang, Xuming He
NeurIPS 2025 Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning Yuhao Zhou, Yiheng Wang, Xuming He, Ruoyao Xiao, Zhiwei Li, Qiantai Feng, Zijie Guo, Yuejin Yang, Hao Wu, Wenxuan Huang, Jiaqi Wei, Dan Si, Yao Xiuqi, Jia Bu, Haiwen Huang, Tianfan Fu, Shixiang Tang, Ben Fei, Dongzhan Zhou, Fenghua Ling, Yan Lu, Siqi Sun, Chenhui Li, Guanjie Zheng, Jiancheng Lv, Wenlong Zhang, Lei Bai
NeurIPS 2025 TokMan:Tokenize Manhattan Mask Optimization for Inverse Lithography Yiwen Wu, Yuyang Chen, Ye Xia, Yao Zhao, Jingya Wang, Xuming He, Hao Geng, Jingyi Yu
NeurIPS 2024 CryoGEM: Physics-Informed Generative Cryo-Electron Microscopy Jiakai Zhang, Qihe Chen, Yan Zeng, Wenyuan Gao, Xuming He, Zhijie Liu, Jingyi Yu
CVPR 2024 DSGG: Dense Relation Transformer for an End-to-End Scene Graph Generation Zeeshan Hayder, Xuming He
ECCV 2024 Dual-Level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation Ruijie Xu, Chuyu Zhang, Hui Ren, Xuming He
CVPR 2024 From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models Rongjie Li, Songyang Zhang, Dahua Lin, Kai Chen, Xuming He
NeurIPS 2024 Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts Zhitong Gao, Bingnan Li, Mathieu Salzmann, Xuming He
CVPR 2024 Learning by Correction: Efficient Tuning Task for Zero-Shot Generative Vision-Language Reasoning Rongjie Li, Yu Wu, Xuming He
AAAI 2024 Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training Longtian Qiu, Shan Ning, Xuming He
ICLR 2024 P$^2$OT: Progressive Partial Optimal Transport for Deep Imbalanced Clustering Chuyu Zhang, Hui Ren, Xuming He
IJCAI 2024 RealDex: Towards Human-like Grasping for Robotic Dexterous Hand Yumeng Liu, Yaxun Yang, Youzhuo Wang, Xiaofei Wu, Jiamin Wang, Yichen Yao, Sören Schwertfeger, Sibei Yang, Wenping Wang, Jingyi Yu, Xuming He, Yuexin Ma
ECCV 2024 SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-Modal Large Language Models Ziyi Lin, Dongyang Liu, Renrui Zhang, Peng Gao, Longtian Qiu, Han Xiao, Han Qiu, Wenqi Shao, Keqin Chen, Jiaming Han, Siyuan Huang, Yichi Zhang, Xuming He, Yu Qiao, Hongsheng Li
NeurIPS 2023 ATTA: Anomaly-Aware Test-Time Adaptation for Out-of-Distribution Detection in Segmentation Zhitong Gao, Shipeng Yan, Xuming He
AAAI 2023 CALIP: Zero-Shot Enhancement of CLIP with Parameter-Free Attention Ziyu Guo, Renrui Zhang, Longtian Qiu, Xianzheng Ma, Xupeng Miao, Xuming He, Bin Cui
ICCV 2023 Class-Relation Knowledge Distillation for Novel Class Discovery Peiyan Gu, Chuyu Zhang, Ruijie Xu, Xuming He
ICCV 2023 Grounded Image Text Matching with Mismatched Relation Reasoning Yu Wu, Yana Wei, Haozhe Wang, Yongfei Liu, Sibei Yang, Xuming He
CVPR 2023 HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models Shan Ning, Longtian Qiu, Yongfei Liu, Xuming He
ICCV 2023 Human-Centric Scene Understanding for 3D Large-Scale Scenarios Yiteng Xu, Peishan Cong, Yichen Yao, Runnan Chen, Yuenan Hou, Xinge Zhu, Xuming He, Jingyi Yu, Yuexin Ma
IJCAI 2023 MILD: Modeling the Instance Learning Dynamics for Learning with Noisy Labels Chuanyang Hu, Shipeng Yan, Zhitong Gao, Xuming He
ICLR 2023 Modeling Multimodal Aleatoric Uncertainty in Segmentation with Mixture of Stochastic Experts Zhitong Gao, Yucong Chen, Chuyu Zhang, Xuming He
TMLR 2023 Novel Class Discovery for Long-Tailed Recognition Chuyu Zhang, Ruijie Xu, Xuming He
ICCVW 2023 The Robust Semantic Segmentation UNCV2023 Challenge Results Xuanlong Yu, Yi Zuo, Zitao Wang, Xiaowen Zhang, Jiaxuan Zhao, Yuting Yang, Licheng Jiao, Rui Peng, Xinyi Wang, Junpei Zhang, Kexin Zhang, Fang Liu, Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo, Hanlin Tian, Kenta Matsui, Tianhao Wang, Fahmy Adan, Zhitong Gao, Xuming He, Quentin Bouniot, Hossein Moghaddam, Shyam Nandan Rai, Fabio Cermelli, Carlo Masone, Andrea Pilzer, Elisa Ricci, Andrei Bursuc, Arno Solin, Martin Trapp, Rui Li, Angela Yao, Wenlong Chen, Ivor Simpson, Neill D. F. Campbell, Gianni Franchi
ICLR 2023 Weakly-Supervised HOI Detection via Prior-Guided Bi-Level Representation Learning Bo Wan, Yongfei Liu, Desen Zhou, Tinne Tuytelaars, Xuming He
CVPR 2022 General Incremental Learning with Domain-Aware Categorical Representations Jiangwei Xie, Shipeng Yan, Xuming He
ECCV 2022 Generative Negative Text Replay for Continual Vision-Language Pretraining Shipeng Yan, Lanqing Hong, Hang Xu, Jianhua Han, Tinne Tuytelaars, Zhenguo Li, Xuming He
ECCV 2022 Learning Semantic Correspondence with Sparse Annotations Shuaiyi Huang, Luyu Yang, Bo He, Songyang Zhang, Xuming He, Abhinav Shrivastava
CVPR 2022 SGTR: End-to-End Scene Graph Generation with Transformer Rongjie Li, Songyang Zhang, Xuming He
CVPR 2021 Bipartite Graph Network with Adaptive Message Passing for Unbiased Scene Graph Generation Rongjie Li, Songyang Zhang, Bo Wan, Xuming He
CVPR 2021 DER: Dynamically Expandable Representation for Class Incremental Learning Shipeng Yan, Jiangwei Xie, Xuming He
CVPR 2021 Distribution Alignment: A Unified Framework for Long-Tail Visual Recognition Songyang Zhang, Zeming Li, Shipeng Yan, Xuming He, Jian Sun
NeurIPS 2021 Dynamic Grained Encoder for Vision Transformers Lin Song, Songyang Zhang, Songtao Liu, Zeming Li, Xuming He, Hongbin Sun, Jian Sun, Nanning Zheng
ICCV 2021 GNeRF: GAN-Based Neural Radiance Field Without Posed Camera Quan Meng, Anpei Chen, Haimin Luo, Minye Wu, Hao Su, Lan Xu, Xuming He, Jingyi Yu
IJCAI 2021 Learning Implicit Temporal Alignment for Few-Shot Video Classification Songyang Zhang, Jiale Zhou, Xuming He
CVPR 2021 Relation-Aware Instance Refinement for Weakly Supervised Visual Grounding Yongfei Liu, Bo Wan, Lin Ma, Xuming He
AAAI 2020 Learning Cross-Modal Context Graph for Visual Grounding Yongfei Liu, Bo Wan, Xiaodan Zhu, Xuming He
ECCV 2020 Part-Aware Prototype Network for Few-Shot Semantic Segmentation Yongfei Liu, Xiangyi Zhang, Songyang Zhang, Xuming He
AAAI 2019 A Dual Attention Network with Semantic Embedding for Few-Shot Learning Shipeng Yan, Songyang Zhang, Xuming He
CVPRW 2019 A Dual Attention Network with Semantic Embedding for Few-Shot Learning Shipeng Yan, Songyang Zhang, Xuming He
ICML 2019 LatentGNN: Learning Efficient Non-Local Relations for Visual Recognition Songyang Zhang, Xuming He, Shipeng Yan
AAAI 2018 3D Box Proposals from a Single Monocular Image of an Indoor Scene Wei Zhuo, Mathieu Salzmann, Xuming He, Miaomiao Liu
WACV 2018 Instance-Aware Detailed Action Labeling in Videos Hongtao Yang, Xuming He, Fatih Porikli
CVPR 2017 Boundary-Aware Instance Segmentation Zeeshan Hayder, Xuming He, Mathieu Salzmann
ICCV 2017 Deep Free-Form Deformation Network for Object-Mask Registration Haoyang Zhang, Xuming He
CVPRW 2017 Efficient Scene Layout Aware Object Detection for Traffic Surveillance Tao Wang, Xuming He, Songzhi Su, Yin Guan
CVPR 2017 Indoor Scene Parsing with Instance Segmentation, Semantic Labeling and Support Relationship Inference Wei Zhuo, Mathieu Salzmann, Xuming He, Miaomiao Liu
IJCAI 2017 Learning Deep Structured Network for Weakly Supervised Change Detection Salman Hameed Khan, Xuming He, Fatih Porikli, Mohammed Bennamoun, Ferdous Ahmed Sohel, Roberto Togneri
WACV 2017 Learning Spatial Transforms for Refining Object Segment Proposals Haoyang Zhang, Xuming He, Fatih Porikli
CVPR 2017 Predicting Salient Face in Multiple-Face Videos Yufan Liu, Songyang Zhang, Mai Xu, Xuming He
ECCV 2016 Building Scene Models by Completing and Hallucinating Depth and Semantics Miaomiao Liu, Xuming He, Mathieu Salzmann
ECCV 2016 Learning Dynamic Hierarchical Models for Anytime Scene Labeling Buyu Liu, Xuming He
CVPR 2016 Learning to Co-Generate Object Proposals with a Deep Structured Network Zeeshan Hayder, Xuming He, Mathieu Salzmann
AAAI 2016 SentiCap: Generating Image Descriptions with Sentiments Alexander Patrick Mathews, Lexing Xie, Xuming He
WACV 2015 Choosing Basic-Level Concept Names Using Visual and Language Context Alexander Patrick Mathews, Lexing Xie, Xuming He
CVPR 2015 Indoor Scene Structure Analysis for Single Image Depth Estimation Wei Zhuo, Mathieu Salzmann, Xuming He, Miaomiao Liu
WACV 2015 Motion Segmentation of Truncated Signed Distance Function Based Volumetric Surfaces Samunda Perera, Nick Barnes, Xuming He, Shahram Izadi, Pushmeet Kohli, Ben Glocker
WACV 2015 Multi-Class Semantic Video Segmentation with Exemplar-Based Object Reasoning Buyu Liu, Xuming He, Stephen Gould
CVPR 2015 Multiclass Semantic Video Segmentation with Object-Level Active Inference Buyu Liu, Xuming He
CVPR 2015 Separating Objects and Clutter in Indoor Scenes Salman H. Khan, Xuming He, Mohammed Bennamoun, Ferdous Sohel, Roberto Togneri
ICCV 2015 Structural Kernel Learning for Large Scale Multiclass Object Co-Detection Zeeshan Hayder, Xuming He, Mathieu Salzmann
CVPR 2014 An Exemplar-Based CRF for Multi-Instance Object Segmentation Xuming He, Stephen Gould
CVPR 2014 Discrete-Continuous Depth Estimation from a Single Image Miaomiao Liu, Mathieu Salzmann, Xuming He
WACV 2014 Joint Semantic and Geometric Segmentation of Videos with a Stage Model Buyu Liu, Xuming He, Stephen Gould
ECCV 2014 Object Co-Detection via Efficient Inference in a Fully-Connected CRF Zeeshan Hayder, Mathieu Salzmann, Xuming He
ECCV 2014 Superpixel Graph Label Transfer with Learned Distance Metric Stephen Gould, Jiecheng Zhao, Xuming He, Yuhang Zhang
CVPR 2013 Learning Structured Hough Voting for Joint Object Detection and Occlusion Reasoning Tao Wang, Xuming He, Nick Barnes
ICCVW 2013 Multi-Instance Object Segmentation with Exemplars Xuming He, Stephen Gould
CVPR 2013 Winding Number for Region-Boundary Consistent Salient Contour Extraction Yansheng Ming, Hongdong Li, Xuming He
CVPR 2012 Connected Contours: A New Contour Completion Model That Respects the Closure Effect Yansheng Ming, Hongdong Li, Xuming He
NeurIPS 2010 A Unified Model of Short-Range and Long-Range Motion Perception Shuang Wu, Xuming He, Hongjing Lu, Alan L. Yuille
ECCV 2010 Occlusion Boundary Detection Using Pseudo-Depth Xuming He, Alan L. Yuille
CVPR 2008 Latent Topic Random Fields: Learning Using a Taxonomy of Labels Xuming He, Richard S. Zemel
NeurIPS 2008 Learning Hybrid Models for Image Annotation with Partially Labeled Data Xuming He, Richard S. Zemel
ECCV 2006 Learning and Incorporating Top-Down Cues in Image Segmentation Xuming He, Richard S. Zemel, Debajyoti Ray
CVPR 2004 Multiscale Conditional Random Fields for Image Labeling Xuming He, Richard S. Zemel, Miguel Á. Carreira-Perpiñán