Kumar, P. R.
12 publications
NeurIPS
2024
Is O(log N) Practical? Near-Equivalence Between Delay Robustness and Bounded Regret in Bandits and RL
NeurIPS
2023
Provably Fast Convergence of Independent Natural Policy Gradient for Markov Potential Games
NeurIPS
2022
Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning