Han, Yuxuan
13 publications
ICLR
2026
DR-SAC: Distributionally Robust Soft Actor-Critic for Reinforcement Learning Under Uncertainty
NeurIPS
2025
Improved Confidence Regions and Optimal Algorithms for Online and Offline Linear MNL Bandits
13 publications