Panda, Prashansa

2 publications

AAAI 2025 Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation Prashansa Panda, Shalabh Bhatnagar
UAI 2024 Finite-Time Analysis of Three-Timescale Constrained Actor-Critic and Constrained Natural Actor-Critic Algorithms. Prashansa Panda, Shalabh Bhatnagar