ML Anthology
Authors
Search
About
Bai, Clive
1 publications
ICLR
2026
Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient Reasoners
Xin Xu
,
Clive Bai
,
Kai Yang
,
Tianhao Chen
,
Yang Wang
,
Saiyong Yang
,
Can Yang