Tian, Hongtao

2 publications

ICLR 2026 Learning More with Less: A Dynamic Dual-Level Down-Sampling Framework for Efficient Policy Optimization Chao Wang, Tao Yang, Hongtao Tian, Yunsheng Shi, Qiyao Ma, XiaotaoLiu, Ting Yao, Wenbo Ding
ICML 2025 Discriminative Policy Optimization for Token-Level Reward Models Hongzhan Chen, Tao Yang, Shiping Gao, Ruijun Chen, Xiaojun Quan, Hongtao Tian, Ting Yao