Liu, Liyuan

25 publications

NeurIPS 2025 Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation Liliang Ren, Congcong Chen, Haoran Xu, Young Jin Kim, Adam Atkinson, Zheng Zhan, Jiankai Sun, Baolin Peng, Liyuan Liu, Shuohang Wang, Hao Cheng, Jianfeng Gao, Weizhu Chen, Yelong Shen
NeurIPS 2025 Mixture of Inputs: Text Generation Beyond Discrete Token Sampling Yufan Zhuang, Liyuan Liu, Chandan Singh, Jingbo Shang, Jianfeng Gao
ICML 2025 On the Generalization Ability of Next-Token-Prediction Pretraining Zhihao Li, Xue Jiang, Liyuan Liu, Xuelin Zhang, Hong Chen, Feng Zheng
NeurIPS 2025 Reinforcement Learning for Reasoning in Large Language Models with One Training Example Yiping Wang, Qing Yang, Zhiyuan Zeng, Liliang Ren, Liyuan Liu, Baolin Peng, Hao Cheng, Xuehai He, Kuan Wang, Jianfeng Gao, Weizhu Chen, Shuohang Wang, Simon Shaolei Du, Yelong Shen
NeurIPS 2025 Routing Mamba: Scaling State Space Models with Mixture-of-Experts Projection Zheng Zhan, Liliang Ren, Shuohang Wang, Liyuan Liu, Yang Liu, Yeyun Gong, Yanzhi Wang, Yelong Shen
NeurIPS 2025 Training Language Models to Generate Quality Code with Program Analysis Feedback Feng Yao, Zilong Wang, Liyuan Liu, Junxia Cui, Li Zhong, Xiaohan Fu, Haohui Mai, Vish Krishnan, Jianfeng Gao, Jingbo Shang
IJCAI 2025 Trajectory-Dependent Generalization Bounds for Pairwise Learning with Φ-Mixing Samples Liyuan Liu, Hong Chen, Weifu Li, Tieliang Gong, Hao Deng, Yulong Wang
ICLR 2025 Vector-ICL: In-Context Learning with Continuous Vector Representations Yufan Zhuang, Chandan Singh, Liyuan Liu, Jingbo Shang, Jianfeng Gao
ICLR 2024 Fast-ELECTRA for Efficient Pre-Training Chengyu Dong, Liyuan Liu, Hao Cheng, Jingbo Shang, Jianfeng Gao, Xiaodong Liu
NeurIPSW 2024 LORC: Low-Rank Compression for LLMs KV Cache with a Progressive Compression Strategy Rongzhi Zhang, Kuan Wang, Liyuan Liu, Shuohang Wang, Hao Cheng, Chao Zhang, Yelong Shen
TMLR 2024 Learning a Decision Tree Algorithm with Transformers Yufan Zhuang, Liyuan Liu, Chandan Singh, Jingbo Shang, Jianfeng Gao
ICLR 2024 Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs Suyu Ge, Yunan Zhang, Liyuan Liu, Minjia Zhang, Jiawei Han, Jianfeng Gao
ICLR 2024 Tell Your Model Where to Attend: Post-Hoc Attention Steering for LLMs Qingru Zhang, Chandan Singh, Liyuan Liu, Xiaodong Liu, Bin Yu, Jianfeng Gao, Tuo Zhao
ICLR 2024 Toward Student-Oriented Teacher Network Training for Knowledge Distillation Chengyu Dong, Liyuan Liu, Jingbo Shang
NeurIPS 2023 Bridging Discrete and Backpropagation: Straight-Through and Beyond Liyuan Liu, Chengyu Dong, Xiaodong Liu, Bin Yu, Jianfeng Gao
NeurIPSW 2023 Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs Suyu Ge, Yunan Zhang, Liyuan Liu, Minjia Zhang, Jiawei Han, Jianfeng Gao
NeurIPSW 2023 Sparse Backpropagation for MoE Training Liyuan Liu, Jianfeng Gao, Weizhu Chen
NeurIPSW 2023 Tell Your Model Where to Attend: Post-Hoc Attention Steering for LLMs Qingru Zhang, Chandan Singh, Liyuan Liu, Xiaodong Liu, Bin Yu, Jianfeng Gao, Tuo Zhao
NeurIPSW 2023 Toward Student-Oriented Teacher Network Training for Knowledge Distillation Chengyu Dong, Liyuan Liu, Jingbo Shang
NeurIPS 2022 Label Noise in Adversarial Training: A Novel Perspective to Study Robust Overfitting Chengyu Dong, Liyuan Liu, Jingbo Shang
AAAI 2021 Empower Distantly Supervised Relation Extraction with Collaborative Adversarial Training Tao Chen, Haochen Shi, Liyuan Liu, Siliang Tang, Jian Shao, Zhigang Chen, Yueting Zhuang
ICLR 2020 On the Variance of the Adaptive Learning Rate and Beyond Liyuan Liu, Haoming Jiang, Pengcheng He, Weizhu Chen, Xiaodong Liu, Jianfeng Gao, Jiawei Han
ICML 2020 Towards Adaptive Residual Network Training: A Neural-ODE Perspective Chengyu Dong, Liyuan Liu, Zichao Li, Jingbo Shang
AAAI 2019 Cross-Relation Cross-Bag Attention for Distantly-Supervised Relation Extraction Yujin Yuan, Liyuan Liu, Siliang Tang, Zhongfei Zhang, Yueting Zhuang, Shiliang Pu, Fei Wu, Xiang Ren
AAAI 2018 Empower Sequence Labeling with Task-Aware Neural Language Model Liyuan Liu, Jingbo Shang, Xiang Ren, Frank Fangzheng Xu, Huan Gui, Jian Peng, Jiawei Han