Shang, Lifeng

29 publications

TMLR 2026 The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs Jierun Chen, Tiezheng Yu, Haoli Bai, Lewei Yao, Jiannan Wu, Kaican Li, Fei Mi, Chaofan Tao, Lei Zhu, Manyi Zhang, Xiao-Hui Li, Lu Hou, Lifeng Shang, Qun Liu
ICLR 2025 Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization Yuxin Jiang, Bo Huang, Yufei Wang, Xingshan Zeng, Liangyou Li, Yasheng Wang, Xin Jiang, Lifeng Shang, Ruiming Tang, Wei Wang
NeurIPS 2025 DeepDiver: Adaptive Web-Search Intensity Scaling via Reinforcement Learning Wenxuan Shi, Haochen Tan, Chuqiao Kuang, Xiaoguang Li, Hanting Chen, Xiaozhe Ren, Yasheng Wang, Lu Hou, Lifeng Shang
ICML 2025 Flat-LoRA: Low-Rank Adaptation over a Flat Loss Landscape Tao Li, Zhengbao He, Yujun Li, Yasheng Wang, Lifeng Shang, Xiaolin Huang
NeurIPS 2025 QFFT, Question-Free Fine-Tuning for Adaptive Reasoning Wanlong Liu, Junxiao Xu, Fei Yu, Yukang Lin, Ke Ji, Wenyu Chen, Lifeng Shang, Yasheng Wang, Yan Xu, Benyou Wang
ICLR 2025 RevisEval: Improving LLM-as-a-Judge via Response-Adapted References Qiyuan Zhang, Yufei Wang, Tiezheng Yu, Yuxin Jiang, Chuhan Wu, Liangyou Li, Yasheng Wang, Xin Jiang, Lifeng Shang, Ruiming Tang, Fuyuan Lyu, Chen Ma
NeurIPS 2025 RidgeLoRA: Matrix Ridge Enhanced Low-Rank Adaptation of Large Language Models Junda Zhu, Jun Ai, Yujun Li, Yichun Yin, Yasheng Wang, Lifeng Shang, Qun Liu
ICLR 2025 ToolACE: Winning the Points of LLM Function Calling Weiwen Liu, Xu Huang, Xingshan Zeng, Xinlong Hao, Shuai Yu, Dexun Li, Shuai Wang, Weinan Gan, Zhengying Liu, Yuanqing Yu, Zezhong Wang, Yuxian Wang, Wu Ning, Yutai Hou, Bin Wang, Chuhan Wu, Wang Xinzhi, Yong Liu, Yasheng Wang, Duyu Tang, Dandan Tu, Lifeng Shang, Xin Jiang, Ruiming Tang, Defu Lian, Qun Liu, Enhong Chen
NeurIPSW 2024 Flat-LoRA: Low-Rank Adaption over a Flat Loss Landscape Tao Li, Zhengbao He, Yujun Li, Yasheng Wang, Lifeng Shang, Xiaolin Huang
ICLR 2024 Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis Kai Chen, Chunwei Wang, Kuo Yang, Jianhua Han, Lanqing Hong, Fei Mi, Hang Xu, Zhengying Liu, Wenyong Huang, Zhenguo Li, Dit-Yan Yeung, Lifeng Shang
AAAI 2024 Preparing Lessons for Progressive Training on Language Models Yu Pan, Ye Yuan, Yichun Yin, Jiaxin Shi, Zenglin Xu, Ming Zhang, Lifeng Shang, Xin Jiang, Qun Liu
ICLR 2024 Retrieval-Based Disentangled Representation Learning with Natural Language Supervision Jiawei Zhou, Xiaoguang Li, Lifeng Shang, Xin Jiang, Qun Liu, Lei Chen
NeurIPS 2023 Reusing Pretrained Models by Multi-Linear Operators for Efficient Training Yu Pan, Ye Yuan, Yichun Yin, Zenglin Xu, Lifeng Shang, Xin Jiang, Qun Liu
AAAI 2023 Self-Supervised Logic Induction for Explainable Fuzzy Temporal Commonsense Reasoning Bibo Cai, Xiao Ding, Zhouhao Sun, Bing Qin, Ting Liu, Baojun Wang, Lifeng Shang
ICLR 2022 Exploring Extreme Parameter Compression for Pre-Trained Language Models Benyou Wang, Yuxin Ren, Lifeng Shang, Xin Jiang, Qun Liu
NeurIPS 2022 Towards Efficient Post-Training Quantization of Pre-Trained Language Models Haoli Bai, Lu Hou, Lifeng Shang, Xin Jiang, Irwin King, Michael R Lyu
AAAI 2021 HopRetriever: Retrieve Hops over Wikipedia to Answer Complex Questions Shaobo Li, Xiaoguang Li, Lifeng Shang, Xin Jiang, Qun Liu, Chengjie Sun, Zhenzhou Ji, Bingquan Liu
ICML 2021 Improved OOD Generalization via Adversarial Training and Pretraing Mingyang Yi, Lu Hou, Jiacheng Sun, Lifeng Shang, Xin Jiang, Qun Liu, Zhiming Ma
AAAI 2021 Noninvasive Self-Attention for Side Information Fusion in Sequential Recommendation Chang Liu, Xiaoguang Li, Guohao Cai, Zhenhua Dong, Hong Zhu, Lifeng Shang
ICLR 2021 On Position Embeddings in BERT Benyou Wang, Lifeng Shang, Christina Lioma, Xin Jiang, Hao Yang, Qun Liu, Jakob Grue Simonsen
ICLR 2021 Reweighting Augmented Samples by Minimizing the Maximal Expected Loss Mingyang Yi, Lu Hou, Lifeng Shang, Xin Jiang, Qun Liu, Zhi-Ming Ma
AAAI 2020 Dialog State Tracking with Reinforced Data Augmentation Yichun Yin, Lifeng Shang, Xin Jiang, Xiao Chen, Qun Liu
NeurIPS 2020 DynaBERT: Dynamic BERT with Adaptive Width and Depth Lu Hou, Zhiqi Huang, Lifeng Shang, Xin Jiang, Xiao Chen, Qun Liu
AAAI 2017 Neural Machine Translation with Reconstruction Zhaopeng Tu, Yang Liu, Lifeng Shang, Xiaohua Liu, Hang Li
IJCAI 2016 Neural Generative Question Answering Jun Yin, Xin Jiang, Zhengdong Lu, Lifeng Shang, Hang Li, Xiaoming Li
ICCV 2015 Multimodal Convolutional Neural Networks for Matching Image and Sentence Lin Ma, Zhengdong Lu, Lifeng Shang, Hang Li
ECCV 2012 Mode Seeking with an Adaptive Distance Measure Guodong Pan, Lifeng Shang, Dirk Schnieders, Kwan-Yee Kenneth Wong
ECCVW 2012 Mode Seeking with an Adaptive Distance Measure Guodong Pan, Lifeng Shang, Dirk Schnieders, Kwan-Yee Kenneth Wong
CVPR 2009 Nonparametric Discriminant HMM and Application to Facial Expression Recognition Lifeng Shang, Kwok-Ping Chan