Mondal, Washim U.

1 publications

AISTATS 2024 Improved Sample Complexity Analysis of Natural Policy Gradient Algorithm with General Parameterization for Infinite Horizon Discounted Reward Markov Decision Processes Washim U. Mondal, Vaneet Aggarwal