ML Anthology
Authors
Search
About
Li, Huaijun
1 publications
ICLR
2026
Fixing the Broken Compass: Diagnosing and Improving Inference-Time Reward Modeling
Jiachun Li
,
Pengfei Cao
,
Zhuoran Jin
,
Yubo Chen
,
Jiexin Xu
,
Huaijun Li
,
Xiaojian Jiang
,
Kang Liu
,
Jun Zhao