Jiang, Xiaojian

1 publications

ICLR 2026 Fixing the Broken Compass: Diagnosing and Improving Inference-Time Reward Modeling Jiachun Li, Pengfei Cao, Zhuoran Jin, Yubo Chen, Jiexin Xu, Huaijun Li, Xiaojian Jiang, Kang Liu, Jun Zhao