ML Anthology
Authors
Search
About
Zhang, Gufeng
1 publications
ICLR
2026
Supervised Reinforcement Learning: From Expert Trajectories to Step-Wise Reasoning
Yihe Deng
,
I-Hung Hsu
,
Jun Yan
,
Zifeng Wang
,
Rujun Han
,
Gufeng Zhang
,
Yanfei Chen
,
Wei Wang
,
Tomas Pfister
,
Chen-Yu Lee