Luo, Zhi-Quan
38 publications
JMLR
2024
Bridging Distributional and Risk-Sensitive Reinforcement Learning with Provable Regret Bounds
NeurIPSW
2024
Entropic Distribution Matching for Supervised Fine-Tuning of LLMs: Less Overfitting and Better Diversity
ICMLW
2024
GPT-HyperAgent: Scalable Uncertainty Estimation and Exploration for Foundation Model Decisions
ICMLW
2023
Improving Adversarial Training for Multiple Perturbations Through the Lens of Uniform Stability
ICMLW
2023
Regret Bounds for Risk-Sensitive Reinforcement Learning with Lipschitz Dynamic Risk Measures