ML Anthology
Authors
Search
About
Lai, Viet Dac
2 publications
ICLR
2026
No Prompt Left Behind: Exploiting Zero-Variance Prompts in LLM Reinforcement Learning via Entropy-Guided Advantage Shaping
Thanh-Long V. Le
,
Myeongho Jeon
,
Kim Vu
,
Viet Dac Lai
,
Eunho Yang
NeurIPS
2025
Offline RL by Reward-Weighted Fine-Tuning for Conversation Optimization
Subhojyoti Mukherjee
,
Viet Dac Lai
,
Raghavendra Addanki
,
Ryan A. Rossi
,
Seunghyun Yoon
,
Trung Bui
,
Anup Rao
,
Jayakumar Subramanian
,
Branislav Kveton