Bai, Fengshuo

8 publications

ICLR 2025 Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs Zhaowei Zhang, Fengshuo Bai, Qizhi Chen, Chengdong Ma, Mingzhi Wang, Haoran Sun, Zilong Zheng, Yaodong Yang
NeurIPS 2025 DexFlyWheel: A Scalable and Self-Improving Data Generation Framework for Dexterous Manipulation Kefei Zhu, Fengshuo Bai, YuanHao Xiang, Yishuai Cai, Xinglin Chen, Ruochong Li, Xingtao Wang, Hao Dong, Yaodong Yang, Xiaopeng Fan, Yuanpei Chen
AAAI 2025 RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors Fengshuo Bai, Runze Liu, Yali Du, Ying Wen, Yaodong Yang
NeurIPS 2025 STAR: Efficient Preference-Based Reinforcement Learning via Dual Regularization Fengshuo Bai, Rui Zhao, Hongming Zhang, Sijia Cui, Shao Zhang, Bo Xu, Lei Han, Ying Wen, Yaodong Yang
ICML 2024 PEARL: Zero-Shot Cross-Task Preference Alignment and Robust Reward Learning for Robotic Manipulation Runze Liu, Yali Du, Fengshuo Bai, Jiafei Lyu, Xiu Li
AAAI 2023 PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction Fengshuo Bai, Hongming Zhang, Tianyang Tao, Zhiheng Wu, Yanna Wang, Bo Xu
NeurIPSW 2023 Zero-Shot Cross-Task Preference Alignment for Offline RL via Optimal Transport Runze Liu, Yali Du, Fengshuo Bai, Jiafei Lyu, Xiu Li
NeurIPS 2022 Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-Based Reinforcement Learning Runze Liu, Fengshuo Bai, Yali Du, Yaodong Yang