Peng, Andy

2 publications

ICLR 2026 Self-Improving Vision-Language-Action Models with Data Generation via Residual RL Wenli Xiao, Haotian Lin, Andy Peng, Haoru Xue, Tairan He, Zhengyi Luo, Yuqi Xie, Fengyuan Hu, Linxi Fan, Guanya Shi, Yuke Zhu
ICLR 2025 Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data Zhiyuan Zhou, Andy Peng, Qiyang Li, Sergey Levine, Aviral Kumar