Chen, Guanxu

1 publications

ICLR 2026 Conditional Advantage Estimation for Reinforcement Learning in Large Reasoning Models Guanxu Chen, Yafu Li, Yuxian Jiang, Chen Qian, Qihan Ren, Yang JingYi, Yu Cheng, Dongrui Liu, Jing Shao