Kwak, Min Gu

1 publications

ICLR 2026 Policy Likelihood-Based Query Sampling and Critic-Exploited Reset for Efficient Preference-Based Reinforcement Learning Jongkook Heo, Jaehoon Kim, Young Jae Lee, Min Gu Kwak, Youngjoon Park, Seoung Bum Kim