Lu, Hengtong

1 publications

AAAI 2025 Data with High and Consistent Preference Difference Are Better for Reward Model Qi Lin, Hengtong Lu, Caixia Yuan, Xiaojie Wang, Huixing Jiang, Wei Chen