Kim, Chan Woo

1 publications

NeurIPS 2023 Sequential Preference Ranking for Efficient Reinforcement Learning from Human Feedback Minyoung Hwang, Gunmin Lee, Hogun Kee, Chan Woo Kim, Kyungjae Lee, Songhwai Oh