Tian, Hongtao

1 publications

ICML 2025 Discriminative Policy Optimization for Token-Level Reward Models Hongzhan Chen, Tao Yang, Shiping Gao, Ruijun Chen, Xiaojun Quan, Hongtao Tian, Ting Yao