Gao, Shiqing

3 publications

ICML 2025 Controlling Underestimation Bias in Constrained Reinforcement Learning for Safe Exploration Shiqing Gao, Jiaxin Ding, Luoyi Fu, Xinbing Wang
ICML 2025 Extreme Value Policy Optimization for Safe Reinforcement Learning Shiqing Gao, Yihang Zhou, Shuai Shao, Haoyu Luo, Yiheng Bing, Jiaxin Ding, Luoyi Fu, Xinbing Wang
IJCAI 2024 Exterior Penalty Policy Optimization with Penalty Metric Network Under Constraints Shiqing Gao, Jiaxin Ding, Luoyi Fu, Xinbing Wang, Chenghu Zhou