Yu, Huizhen

6 publications

JMLR 2018 On Generalized Bellman Equations and Temporal-Difference Learning Huizhen Yu, A. Rupam Mahmood, Richard S. Sutton
JMLR 2016 Weak Convergence Properties of Constrained Emphatic Temporal-Difference Learning with Constant and Slowly Diminishing Stepsize Huizhen Yu
COLT 2015 On Convergence of Emphatic Temporal-Difference Learning Huizhen Yu
ICML 2010 Convergence of Least Squares Temporal Difference Methods Under General Conditions Huizhen Yu
UAI 2005 A Function Approximation Approach to Estimation of Policy Gradient for POMDP with Structured Policies Huizhen Yu
UAI 2004 Discretized Approximations for POMDP with Average Cost Huizhen Yu, Dimitri P. Bertsekas