Shi, Diyuan

1 publications

ICML 2023 Beyond Reward: Offline Preference-Guided Policy Optimization Yachen Kang, Diyuan Shi, Jinxin Liu, Li He, Donglin Wang