Bhatnagar, Shalabh
21 publications
NeurIPS
2022
Model-Based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algorithm
MLJ
2018
An Incremental Off-Policy Search in a Model-Free Markov Decision Process Using a Single Sample Path