Kumar, P. R.

12 publications

NeurIPS 2024 Is O(log N) Practical? Near-Equivalence Between Delay Robustness and Bounded Regret in Bandits and RL Enoch H. Kang, P. R. Kumar

AISTATS 2024 Provable Policy Gradient Methods for Average-Reward Markov Potential Games Min Cheng, Ruida Zhou, P. R. Kumar, Chao Tian

NeurIPS 2023 Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation Ruida Zhou, Tao Liu, Min Cheng, Dileep Kalathil, P. R. Kumar, Chao Tian

NeurIPS 2023 Provably Fast Convergence of Independent Natural Policy Gradient for Markov Potential Games Youbang Sun, Tao Liu, Ruida Zhou, P. R. Kumar, Shahin Shahrampour

NeurIPS 2022 Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning Ruida Zhou, Tao Liu, Dileep Kalathil, P. R. Kumar, Chao Tian

NeurIPS 2022 Augmented RBMLE-UCB Approach for Adaptive Control of Linear Quadratic Systems Akshay Mete, Rahul Singh, P. R. Kumar

NeurIPS 2022 Learning from Few Samples: Transformation-Invariant SVMs with Composition and Locality at Multiple Scales Tao Liu, P. R. Kumar, Ruida Zhou, Xi Liu

L4DC 2021 Reward Biased Maximum Likelihood Estimation for Reinforcement Learning Akshay Mete, Rahul Singh, Xi Liu, P. R. Kumar

AAAI 2021 Reward-Biased Maximum Likelihood Estimation for Linear Stochastic Bandits Yu-Heng Hung, Ping-Chun Hsieh, Xi Liu, P. R. Kumar

ICML 2019 Stay with Me: Lifetime Maximization Through Heteroscedastic Linear Bandits with Reneging Ping-Chun Hsieh, Xi Liu, Anirban Bhattacharya, P R Kumar

COLT 1992 Learning Stochastic Functions by Smooth Simultaneous Estimation Kevin Buescher, P. R. Kumar

COLT 1991 Simultaneous Learning of Concepts and Simultaneous Estimation of Probabilities Kevin Buescher, P. R. Kumar