Xu, Yinglun
4 publications
TMLR
2025
Two-Step Offline Preference-Based Reinforcement Learning on Explicitly Constrained Policies
Yinglun Xu, Tarun Suresh, Rohan Gumaste, David Zhu, Ruirui Li, Zhengyang Wang, Haoming Jiang, Xianfeng Tang, Qingyu Yin, Monica Xiao Cheng, Qi Zeng, Chao Zhang, Gagandeep Singh