Li, Xiaomin

11 publications

ICLR 2026 Bradley-Terry and Multi-Objective Reward Modeling Are Complementary Zhiwei Zhang, Hui Liu, Xiaomin Li, Zhenwei Dai, Jingying Zeng, Fali Wang, Minhua Lin, Ramraj Chandradevan, Linlin Wu, Zhen Li, Chen Luo, Zongyu Wu, Xianfeng Tang, Qi He, Suhang Wang
ICLR 2026 Multiplayer Nash Preference Optimization Fang Wu, Xu Huang, Weihao Xuan, Zhiwei Zhang, Yijia Xiao, Guancheng Wan, Xiaomin Li, Bing Hu, Peng Xia, Jure Leskovec, Yejin Choi
ICLR 2026 Unlocking the Power of Multi-Agent LLM for Reasoning: From Lazy Agents to Deliberation Zhiwei Zhang, Xiaomin Li, Yudi Lin, Hui Liu, Ramraj Chandradevan, Linlin Wu, Minhua Lin, Fali Wang, Xianfeng Tang, Qi He, Suhang Wang
NeurIPS 2025 CARES: Comprehensive Evaluation of Safety and Adversarial Robustness in Medical LLMs Sijia Chen, Xiaomin Li, Mengxue Zhang, Eric Hanchen Jiang, Qingcheng Zeng, Chen-Hsiang Yu
ICCV 2025 CCL-LGS: Contrastive Codebook Learning for 3D Language Gaussian Splatting Lei Tian, Xiaomin Li, Liqian Ma, Hao Yin, Zirui Zheng, Hefei Huang, Taiqing Li, Huchuan Lu, Xu Jia
ICLR 2025 Catastrophic Failure of LLM Unlearning via Quantization Zhiwei Zhang, Fali Wang, Xiaomin Li, Zongyu Wu, Xianfeng Tang, Hui Liu, Qi He, Wenpeng Yin, Suhang Wang
ICLRW 2025 Data-Adaptive Safety Rules for Training Reward Models Xiaomin Li, Mingye Gao, Zhiwei Zhang, Jingxuan Fan, Weiyu Li
CVPR 2025 ReNeg: Learning Negative Embedding with Reward Guidance Xiaomin Li, Yixuan Liu, Takashi Isobe, Xu Jia, Qinpeng Cui, Dong Zhou, Dong Li, You He, Huchuan Lu, Zhongdao Wang, Emad Barsoum
ICLRW 2025 Rule-Based Rating and Selection of LLM Training Data Xiaomin Li, Mingye Gao, Zhiwei Zhang, Chang Yue, Hong Hu
ICML 2025 RuleAdapter: Dynamic Rules for Training Safety Reward Models in RLHF Xiaomin Li, Mingye Gao, Zhiwei Zhang, Jingxuan Fan, Weiyu Li
NeurIPS 2025 When Thinking Fails: The Pitfalls of Reasoning for Instruction-Following in LLMs Xiaomin Li, Zhou Yu, Zhiwei Zhang, Xupeng Chen, Ziji Zhang, Yingying Zhuang, Narayanan Sadagopan, Anurag Beniwal