ML Anthology
Authors
Search
About
Lodkaew, Thanawat
2 publications
TMLR
2026
On Symmetric Losses for Policy Optimization with Noisy Preferences
Soichiro Nishimori
,
Yu-Jie Zhang
,
Thanawat Lodkaew
,
Masashi Sugiyama
TMLR
2025
Importance Weighting for Aligning Language Models Under Deployment Distribution Shift
Thanawat Lodkaew
,
Tongtong Fang
,
Takashi Ishida
,
Masashi Sugiyama