Zhang, Gufeng

1 publications

ICLR 2026 Supervised Reinforcement Learning: From Expert Trajectories to Step-Wise Reasoning Yihe Deng, I-Hung Hsu, Jun Yan, Zifeng Wang, Rujun Han, Gufeng Zhang, Yanfei Chen, Wei Wang, Tomas Pfister, Chen-Yu Lee