Bai, Clive

1 publications

ICLR 2026 Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient Reasoners Xin Xu, Clive Bai, Kai Yang, Tianhao Chen, Yang Wang, Saiyong Yang, Can Yang