Mahmood, Ashique Rupam

3 publications

UAI 2015 Off-Policy Learning Based on Weighted Importance Sampling with Linear Computational Complexity Ashique Rupam Mahmood, Richard S. Sutton
ICML 2014 A New Q(lambda) with Interim Forward View and Monte Carlo Equivalence Rich Sutton, Ashique Rupam Mahmood, Doina Precup, Hado Hasselt
UAI 2014 Off-Policy TD( L) with a True Online Equivalence Hado van Hasselt, Ashique Rupam Mahmood, Richard S. Sutton