Ding, Ruomeng

1 publications

NeurIPS 2024 Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs Rui Yang, Ruomeng Ding, Yong Lin, Huan Zhang, Tong Zhang