Tang, Huaze

4 publications

ICLR 2026 Policy Newton Algorithm in Reproducing Kernel Hilbert Space Yixian Zhang, Huaze Tang, Changxu Wei, Chao Wang, Wenbo Ding
ICLRW 2025 Inherent Exploration via Sampling for Stochastic Policies Zhenpeng Shi, Chi Xu, Huaze Tang, Wenbo Ding
ICLR 2025 Residual Kernel Policy Network: Enhancing Stability and Robustness in RKHS-Based Reinforcement Learning Yixian Zhang, Huaze Tang, Huijing Lin, Wenbo Ding
ICMLW 2024 Adaptive Two-Level Quasi-Monte Carlo for Soft Actor-Critic Du Ouyang, Zhenpeng Shi, Aodong Guo, Huaze Tang, Hejin Wang, Chao Wang, Wenbo Ding