Zuo, Xingdong

1 publications

NeurIPS 2023 Direct Preference-Based Policy Optimization Without Reward Modeling Gaon An, Junhyeok Lee, Xingdong Zuo, Norio Kosaka, Kyung-Min Kim, Hyun Oh Song