Peng, Xiyue

3 publications

NeurIPS 2025 Enhancing Safety in Reinforcement Learning with Human Feedback via Rectified Policy Optimization Xiyue Peng, Hengquan Guo, Jiawei Zhang, Dongqing Zou, Ziyu Shao, Honghao Wei, Xin Liu
NeurIPS 2024 Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning Honghao Wei, Xiyue Peng, Arnob Ghosh, Xin Liu
NeurIPS 2024 Safe and Efficient: A Primal-Dual Method for Offline Convex CMDPs Under Partial Data Coverage Haobo Zhang, Xiyue Peng, Honghao Wei, Xin Liu