Ju, Seokhun

3 publications

ICML 2025 Bellman Unbiasedness: Toward Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation Taehyun Cho, Seungyub Han, Seokhun Ju, Dohyeong Kim, Kyungjae Lee, Jungwoo Lee
NeurIPS 2025 Pareto Optimal Risk-Agnostic Distributional Bandits with Heavy-Tail Rewards Kyungjae Lee, Dohyeong Kim, Taehyun Cho, Chaeyeon Kim, Yunkyung Ko, Seungyub Han, Seokhun Ju, Dohyeok Lee, Sungbin Lim
ICML 2025 Policy-Labeled Preference Learning: Is Preference Enough for RLHF? Taehyun Cho, Seokhun Ju, Seungyub Han, Dohyeong Kim, Kyungjae Lee, Jungwoo Lee