Deik, Derrick Goh Xin

1 publications

ICMLW 2024 Aligning Crowd Feedback via Distributional Preference Reward Modeling Dexun Li, Cong Zhang, Kuicai Dong, Derrick Goh Xin Deik, Ruiming Tang, Yong Liu