Cao, Xingchen

3 publications

ICML 2025 Improving Reward Model Generalization from Adversarial Process Enhanced Preferences Zhilong Zhang, Tian Xu, Xinghao Du, Xingchen Cao, Yihao Sun, Yang Yu
ICML 2024 Limited Preference Aided Imitation Learning from Imperfect Demonstrations Xingchen Cao, Fan-Ming Luo, Junyin Ye, Tian Xu, Zhilong Zhang, Yang Yu
ICLR 2024 Reward-Consistent Dynamics Models Are Strongly Generalizable for Offline Reinforcement Learning Fan-Ming Luo, Tian Xu, Xingchen Cao, Yang Yu