Cheng, Qinyuan

4 publications

NeurIPS 2025 Implicit Reward as the Bridge: A Unified View of SFT and DPO Connections Bo Wang, Qinyuan Cheng, Runyu Peng, Rong Bao, Peiji Li, Qipeng Guo, Linyang Li, Zhiyuan Zeng, Yunhua Zhou, Xipeng Qiu
ICLRW 2025 World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning Siyin Wang, Zhaoye Fei, Qinyuan Cheng, Shiduo Zhang, Panpan Cai, Jinlan Fu, Xipeng Qiu
ICML 2024 Can AI Assistants Know What They Don’t Know? Qinyuan Cheng, Tianxiang Sun, Xiangyang Liu, Wenwei Zhang, Zhangyue Yin, Shimin Li, Linyang Li, Zhengfu He, Kai Chen, Xipeng Qiu
AAAI 2023 Mitigating Negative Style Transfer in Hybrid Dialogue System Shimin Li, Qinyuan Cheng, Linyang Li, Xipeng Qiu