ML Anthology
Authors
Search
About
Ma, Yiran
1 publications
AAAI
2025
What Are Step-Level Reward Models Rewarding? Counterintuitive Findings from MCTS-Boosted Mathematical Reasoning
Yiran Ma
,
Zui Chen
,
Tianqiao Liu
,
Mi Tian
,
Zhuo Liu
,
Zitao Liu
,
Weiqi Luo