Song, Hosung

1 publications

NeurIPS 2025 KL Penalty Control via Perturbation for Direct Preference Optimization Sangkyu Lee, Janghoon Han, Hosung Song, Stanley Jungkyu Choi, Honglak Lee, Youngjae Yu