Peng, Xiyue

4 publications

ICLR 2026 Keep the Best, Forget the REST: Reliable Alignment with Order-Aware Preference Optimization Jiahui Zhu, Yuanjie Shi, Xiyue Peng, Xin Liu, Yan Yan, Honghao Wei
NeurIPS 2025 Enhancing Safety in Reinforcement Learning with Human Feedback via Rectified Policy Optimization Xiyue Peng, Hengquan Guo, Jiawei Zhang, Dongqing Zou, Ziyu Shao, Honghao Wei, Xin Liu
NeurIPS 2024 Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning Honghao Wei, Xiyue Peng, Arnob Ghosh, Xin Liu
NeurIPS 2024 Safe and Efficient: A Primal-Dual Method for Offline Convex CMDPs Under Partial Data Coverage Haobo Zhang, Xiyue Peng, Honghao Wei, Xin Liu