Zhao, Xuanle

2 publications

ICLR 2026 Breaking the SFT Plateau: Multimodal Structured Reinforcement Learning for Chart-to-Code Generation Lei Chen, Xuanle Zhao, Zhixiong Zeng, Jing Huang, Liming Zheng, Yufeng Zhong, Lin Ma
NeurIPS 2023 ODE-Based Recurrent Model-Free Reinforcement Learning for POMDPs Xuanle Zhao, Duzhen Zhang, Han Liyuan, Tielin Zhang, Bo Xu