Lai, Viet Dac

1 publications

NeurIPS 2025 Offline RL by Reward-Weighted Fine-Tuning for Conversation Optimization Subhojyoti Mukherjee, Viet Dac Lai, Raghavendra Addanki, Ryan A. Rossi, Seunghyun Yoon, Trung Bui, Anup Rao, Jayakumar Subramanian, Branislav Kveton