Gummadi, Ramki

7 publications

AISTATS 2024 Feasible $q$-Learning for Average Reward Reinforcement Learning Ying Jin, Ramki Gummadi, Zhengyuan Zhou, Jose Blanchet
ICML 2024 Target Networks and Over-Parameterization Stabilize Off-Policy Bootstrapping with Function Approximation Fengdi Che, Chenjun Xiao, Jincheng Mei, Bo Dai, Ramki Gummadi, Oscar A Ramirez, Christopher K Harris, A. Rupam Mahmood, Dale Schuurmans
ICML 2022 A Parametric Class of Approximate Gradient Updates for Policy Optimization Ramki Gummadi, Saurabh Kumar, Junfeng Wen, Dale Schuurmans
ICLR 2022 Understanding and Leveraging Overparameterization in Recursive Value Estimation Chenjun Xiao, Bo Dai, Jincheng Mei, Oscar A Ramirez, Ramki Gummadi, Chris Harris, Dale Schuurmans
ICML 2021 Characterizing the Gap Between Actor-Critic and Policy Gradient Junfeng Wen, Saurabh Kumar, Ramki Gummadi, Dale Schuurmans
NeurIPS 2019 Surrogate Objectives for Batch Policy Optimization in One-Step Decision Making Minmin Chen, Ramki Gummadi, Chris Harris, Dale Schuurmans
AISTATS 2018 Variational Rejection Sampling Aditya Grover, Ramki Gummadi, Miguel Lázaro-Gredilla, Dale Schuurmans, Stefano Ermon