Zhang, Kongcheng

3 publications

NeurIPS 2025 Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning Kongcheng Zhang, Qi Yao, Shunyu Liu, Yingjie Wang, Baisheng Lai, Jieping Ye, Mingli Song, Dacheng Tao
IJCAI 2025 Odyssey : Empowering Minecraft Agents with Open-World Skills Shunyu Liu, Yaoru Li, Kongcheng Zhang, Zhenyu Cui, Wenkai Fang, Yuxuan Zheng, Tongya Zheng, Mingli Song
NeurIPS 2025 SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data Wenkai Fang, Shunyu Liu, Yang Zhou, Kongcheng Zhang, Tongya Zheng, Kaixuan Chen, Mingli Song, Dacheng Tao