Cui, Sijia

1 publications

NeurIPS 2025 STAR: Efficient Preference-Based Reinforcement Learning via Dual Regularization Fengshuo Bai, Rui Zhao, Hongming Zhang, Sijia Cui, Shao Zhang, Bo Xu, Lei Han, Ying Wen, Yaodong Yang