ML Anthology
Authors
Search
About
Lai, Viet Dac
1 publications
NeurIPS
2025
Offline RL by Reward-Weighted Fine-Tuning for Conversation Optimization
Subhojyoti Mukherjee
,
Viet Dac Lai
,
Raghavendra Addanki
,
Ryan A. Rossi
,
Seunghyun Yoon
,
Trung Bui
,
Anup Rao
,
Jayakumar Subramanian
,
Branislav Kveton