Wei, Wenya

3 publications

NeurIPS 2025 Improving Reward Models with Proximal Policy Exploration for Preference-Based Reinforcement Learning Yiwen Zhu, Jinyi Liu, Pengjie Gu, Yifu Yuan, Zhenxing Ge, Wenya Wei, Zhou Fang, Yujing Hu, Bo An
NeurIPSW 2024 Optimizing Reward Models with Proximal Policy Exploration in Preference-Based Reinforcement Learning Yiwen Zhu, Jinyi Liu, Yifu Yuan, Wenya Wei, Zhenxing Ge, Qianyi Fu, Zhou Fang, Yujing Hu, Bo An
IJCAI 2024 vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement Yiwen Zhu, Jinyi Liu, Wenya Wei, Qianyi Fu, Yujing Hu, Zhou Fang, Bo An, Jianye Hao, Tangjie Lv, Changjie Fan