Peng, Andy

1 publications

ICLR 2025 Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data Zhiyuan Zhou, Andy Peng, Qiyang Li, Sergey Levine, Aviral Kumar