ML Anthology
Authors
Search
About
Lu, Hengtong
1 publications
AAAI
2025
Data with High and Consistent Preference Difference Are Better for Reward Model
Qi Lin
,
Hengtong Lu
,
Caixia Yuan
,
Xiaojie Wang
,
Huixing Jiang
,
Wei Chen