Zeng, Zhiyuan

11 publications

NeurIPS 2025 Implicit Reward as the Bridge: A Unified View of SFT and DPO Connections Bo Wang, Qinyuan Cheng, Runyu Peng, Rong Bao, Peiji Li, Qipeng Guo, Linyang Li, Zhiyuan Zeng, Yunhua Zhou, Xipeng Qiu
NeurIPS 2025 Precise Information Control in Long-Form Text Generation Jacqueline He, Howard Yen, Margaret Li, Shuyue Stella Li, Zhiyuan Zeng, Weijia Shi, Yulia Tsvetkov, Danqi Chen, Pang Wei Koh, Luke Zettlemoyer
NeurIPS 2025 Reinforcement Learning for Reasoning in Large Language Models with One Training Example Yiping Wang, Qing Yang, Zhiyuan Zeng, Liliang Ren, Liyuan Liu, Baolin Peng, Hao Cheng, Xuehai He, Kuan Wang, Jianfeng Gao, Weizhu Chen, Shuohang Wang, Simon Shaolei Du, Yelong Shen
ICLR 2024 Evaluating Large Language Models at Evaluating Instruction Following Zhiyuan Zeng, Jiatong Yu, Tianyu Gao, Yu Meng, Tanya Goyal, Danqi Chen
ICML 2024 Exploring the Benefit of Activation Sparsity in Pre-Training Zhengyan Zhang, Chaojun Xiao, Qiujieli Qin, Yankai Lin, Zhiyuan Zeng, Xu Han, Zhiyuan Liu, Ruobing Xie, Maosong Sun, Jie Zhou
ICLR 2024 Sheared Llama: Accelerating Language Model Pre-Training via Structured Pruning Mengzhou Xia, Tianyu Gao, Zhiyuan Zeng, Danqi Chen
NeurIPSW 2023 Evaluating Large Language Models at Evaluating Instruction Following Zhiyuan Zeng, Jiatong Yu, Tianyu Gao, Yu Meng, Tanya Goyal, Danqi Chen
ICLRW 2023 KNIFE: Distilling Meta-Reasoning Knowledge with Free-Text Rationales Aaron Chan, Zhiyuan Zeng, Wyatt Lake, Brihi Joshi, Hanjie Chen, Xiang Ren
ICLR 2023 SCoMoE: Efficient Mixtures of Experts with Structured Communication Zhiyuan Zeng, Deyi Xiong
NeurIPSW 2023 Sheared Llama: Accelerating Language Model Pre-Training via Structured Pruning Mengzhou Xia, Tianyu Gao, Zhiyuan Zeng, Danqi Chen
IJCAI 2023 Unsupervised and Few-Shot Parsing from Pretrained Language Models (Extended Abstract) Zhiyuan Zeng, Deyi Xiong