Li, Peiji

2 publications

NeurIPS 2025 Implicit Reward as the Bridge: A Unified View of SFT and DPO Connections Bo Wang, Qinyuan Cheng, Runyu Peng, Rong Bao, Peiji Li, Qipeng Guo, Linyang Li, Zhiyuan Zeng, Yunhua Zhou, Xipeng Qiu
NeurIPS 2025 Mixing Expert Knowledge: Bring Human Thoughts Back to the Game of Go Yichuan Ma, Linyang Li, Yongkang Chen, Peiji Li, Jiasheng Ye, Qipeng Guo, Dahua Lin, Kai Chen