ML Anthology
Authors
Search
About
Tian, Xiangyu
1 publications
NeurIPS
2025
Learning When to Think: Shaping Adaptive Reasoning in R1-Style Models via Multi-Stage RL
Songjun Tu
,
Jiahao Lin
,
Qichao Zhang
,
Xiangyu Tian
,
Linjing Li
,
Xiangyuan Lan
,
Dongbin Zhao