Zhang, Yixian

3 publications

ICLR 2026 Policy Newton Algorithm in Reproducing Kernel Hilbert Space Yixian Zhang, Huaze Tang, Changxu Wei, Chao Wang, Wenbo Ding
ICLR 2026 SAC Flow: Sample-Efficient Reinforcement Learning of Flow-Based Policies via Velocity-Reparameterized Sequential Modeling Yixian Zhang, Shu'ang Yu, Tonghe Zhang, Mo Guang, Haojia Hui, Kaiwen Long, Yu Wang, Chao Yu, Wenbo Ding
ICLR 2025 Residual Kernel Policy Network: Enhancing Stability and Robustness in RKHS-Based Reinforcement Learning Yixian Zhang, Huaze Tang, Huijing Lin, Wenbo Ding