Deepak, H R

1 publications

TMLR 2025 Variance Reduced Smoothed Functional REINFORCE Policy Gradient Algorithms Shalabh Bhatnagar, H R Deepak