Son, Seongho

3 publications

ICLRW 2025 Game-Theoretic Regularized Self-Play Alignment of Large Language Models Xiaohang Tang, Sangwoong Yoon, Seongho Son, Huizhuo Yuan, Quanquan Gu, Ilija Bogunovic
ICML 2025 Right Now, Wrong Then: Non-Stationary Direct Preference Optimization Under Preference Drift Seongho Son, William Bankes, Sayak Ray Chowdhury, Brooks Paige, Ilija Bogunovic
NeurIPSW 2024 Group Robust Best-of-K Decoding of Language Models for Pluralistic Alignment Sangwoong Yoon, William Bankes, Seongho Son, Anja Petrovic, Shyam Sundhar Ramesh, Xiaohang Tang, Ilija Bogunovic