Zhao, Yanxiao

2 publications

ICLR 2026 ComputerRL: Scaling End-to-End Online Reinforcement Learning for Computer Use Agents Hanyu Lai, Xiao Liu, Yanxiao Zhao, Han Xu, Hanchen Zhang, Bohao Jing, Yanyu Ren, Shuntian Yao, Yuxiao Dong, Jie Tang
ICMLW 2024 Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency Yanxiao Zhao, Yangge Qian, Tianyi Wang, Jingyang Shan, Xiaolin Qin