Chen, Yihui

1 publications

NeurIPS 2025 Progress Reward Model for Reinforcement Learning via Large Language Models Xiuhui Zhang, Ning Gao, Xingyu Jiang, Yihui Chen, Yuheng Pan, Mohan Zhang, Yue Deng