Kumar, P. R.

12 publications

NeurIPS 2024 Is O(log N) Practical? Near-Equivalence Between Delay Robustness and Bounded Regret in Bandits and RL Enoch H. Kang, P. R. Kumar
AISTATS 2024 Provable Policy Gradient Methods for Average-Reward Markov Potential Games Min Cheng, Ruida Zhou, P. R. Kumar, Chao Tian
NeurIPS 2023 Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation Ruida Zhou, Tao Liu, Min Cheng, Dileep Kalathil, P. R. Kumar, Chao Tian
NeurIPS 2023 Provably Fast Convergence of Independent Natural Policy Gradient for Markov Potential Games Youbang Sun, Tao Liu, Ruida Zhou, P. R. Kumar, Shahin Shahrampour
NeurIPS 2022 Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning Ruida Zhou, Tao Liu, Dileep Kalathil, P. R. Kumar, Chao Tian
NeurIPS 2022 Augmented RBMLE-UCB Approach for Adaptive Control of Linear Quadratic Systems Akshay Mete, Rahul Singh, P. R. Kumar
NeurIPS 2022 Learning from Few Samples: Transformation-Invariant SVMs with Composition and Locality at Multiple Scales Tao Liu, P. R. Kumar, Ruida Zhou, Xi Liu
L4DC 2021 Reward Biased Maximum Likelihood Estimation for Reinforcement Learning Akshay Mete, Rahul Singh, Xi Liu, P. R. Kumar
AAAI 2021 Reward-Biased Maximum Likelihood Estimation for Linear Stochastic Bandits Yu-Heng Hung, Ping-Chun Hsieh, Xi Liu, P. R. Kumar
ICML 2019 Stay with Me: Lifetime Maximization Through Heteroscedastic Linear Bandits with Reneging Ping-Chun Hsieh, Xi Liu, Anirban Bhattacharya, P R Kumar
COLT 1992 Learning Stochastic Functions by Smooth Simultaneous Estimation Kevin Buescher, P. R. Kumar
COLT 1991 Simultaneous Learning of Concepts and Simultaneous Estimation of Probabilities Kevin Buescher, P. R. Kumar