Fu, Qianyi

2 publications

NeurIPSW 2024 Optimizing Reward Models with Proximal Policy Exploration in Preference-Based Reinforcement Learning Yiwen Zhu, Jinyi Liu, Yifu Yuan, Wenya Wei, Zhenxing Ge, Qianyi Fu, Zhou Fang, Yujing Hu, Bo An
IJCAI 2024 vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement Yiwen Zhu, Jinyi Liu, Wenya Wei, Qianyi Fu, Yujing Hu, Zhou Fang, Bo An, Jianye Hao, Tangjie Lv, Changjie Fan