ML Anthology
Authors
Search
About
Li, Peiji
2 publications
NeurIPS
2025
Implicit Reward as the Bridge: A Unified View of SFT and DPO Connections
Bo Wang
,
Qinyuan Cheng
,
Runyu Peng
,
Rong Bao
,
Peiji Li
,
Qipeng Guo
,
Linyang Li
,
Zhiyuan Zeng
,
Yunhua Zhou
,
Xipeng Qiu
NeurIPS
2025
Mixing Expert Knowledge: Bring Human Thoughts Back to the Game of Go
Yichuan Ma
,
Linyang Li
,
Yongkang Chen
,
Peiji Li
,
Jiasheng Ye
,
Qipeng Guo
,
Dahua Lin
,
Kai Chen