Zhao, Kaixiang

1 publications

ICLR 2026 Provable and Practical In-Context Policy Optimization for Self-Improvement Tianrun Yu, Yuxiao Yang, Zhaoyang Wang, Kaixiang Zhao, Porter Jenkins, Xuchao Zhang, Chetan Bansal, Huaxiu Yao, Weitong Zhang