Lodkaew, Thanawat

2 publications

TMLR 2026 On Symmetric Losses for Policy Optimization with Noisy Preferences Soichiro Nishimori, Yu-Jie Zhang, Thanawat Lodkaew, Masashi Sugiyama
TMLR 2025 Importance Weighting for Aligning Language Models Under Deployment Distribution Shift Thanawat Lodkaew, Tongtong Fang, Takashi Ishida, Masashi Sugiyama