Patil, Gandharv

5 publications

ICLR 2026 Robust Reward Modeling via Causal Rubrics Pragya Srivastava, Harman Singh, Rahul Madhavan, Gandharv Patil, Sravanti Addepalli, Arun Suggala, Rengarajan Aravamudhan, Soumya Sharma, Anirban Laha, Aravindan Raghuveer, Karthikeyan Shanmugam, Doina Precup
AISTATS 2024 On Learning History-Based Policies for Controlling Markov Decision Processes Gandharv Patil, Aditya Mahajan, Doina Precup
AISTATS 2023 Finite Time Analysis of Temporal Difference Learning with Linear Function Approximation: Tail Averaging and Regularisation Gandharv Patil, Prashanth L.A., Dheeraj Nagaraj, Doina Precup
ICMLW 2023 On Learning History-Based Policies for Controlling Markov Decision Processes Gandharv Patil, Aditya Mahajan, Doina Precup
AAAI 2021 Variance Penalized On-Policy and Off-Policy Actor-Critic Arushi Jain, Gandharv Patil, Ayush Jain, Khimya Khetarpal, Doina Precup