Song, Huizhong

3 publications

NeurIPS 2025 Risk-Aware Direct Preference Optimization Under Nested Risk Measure Lijun Zhang, Lin Li, Yajie Qi, Huizhong Song, Yaodong Yang, Jun Wang, Wei Wei
NeurIPS 2024 Scalable Constrained Policy Optimization for Safe Multi-Agent Reinforcement Learning Lijun Zhang, Lin Li, Wei Wei, Huizhong Song, Yaodong Yang, Jiye Liang
ICML 2023 Set-Membership Belief State-Based Reinforcement Learning for POMDPs Wei Wei, Lijun Zhang, Lin Li, Huizhong Song, Jiye Liang