Han, Xiaoyu

1 publications

ICLR 2026 Optimal Aggregation of LLM and PRM Signals for Efficient Test-Time Scaling Peng Kuang, Yanli Wang, Xiaoyu Han, Yaowenqi Liu, Kaidi Xu, Haohan Wang