Jain, Rahul

24 publications

AISTATS 2025 A Safe Bayesian Learning Algorithm for Constrained MDPs with Bounded Constraint Violation Krishna C Kalagarla, Rahul Jain, Pierluigi Nuzzo
L4DC 2025 Conditional Kernel Imitation Learning for Continuous State Environments Rishabh Agrawal, Nathan Dahlin, Rahul Jain, Ashutosh Nayyar
AAAI 2025 Markov Balance Satisfaction Improves Performance in Strictly Batch Offline Imitation Learning Rishabh Agrawal, Nathan Dahlin, Rahul Jain, Ashutosh Nayyar
NeurIPS 2025 Robust LLM Alignment via Distributionally Robust Direct Preference Optimization Zaiyan Xu, Sushil Vemuri, Kishan Panaganti, Dileep Kalathil, Rahul Jain, Deepak Ramachandran
ICML 2024 ACPO: A Policy Optimization Algorithm for Average MDPs with Constraints Akhil Agnihotri, Rahul Jain, Haipeng Luo
NeurIPS 2024 E-COP : Episodic Constrained Optimization of Policies Akhil Agnihotri, Rahul Jain, Deepak Ramachandran, Sahil Singla
NeurIPSW 2024 Policy Optimization for Strictly Batch Imitation Learning Rishabh Agrawal, Nathan Dahlin, Rahul Jain, Ashutosh Nayyar
NeurIPSW 2023 Average-Constrained Policy Optimization Akhil Agnihotri, Rahul Jain, Haipeng Luo
TMLR 2023 Bridging Imitation and Online Reinforcement Learning: An Optimistic Tale Botao Hao, Rahul Jain, Dengwang Tang, Zheng Wen
ICML 2023 Leveraging Demonstrations to Improve Online Learning: Quality Matters Botao Hao, Rahul Jain, Tor Lattimore, Benjamin Van Roy, Zheng Wen
UAI 2023 Posterior Sampling-Based Online Learning for the Stochastic Shortest Path Model Mehdi Jafarnia-Jahromi, Liyu Chen, Rahul Jain, Haipeng Luo
NeurIPSW 2023 Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation Krishna C Kalagarla, Rahul Jain, Pierluigi Nuzzo
AISTATS 2022 Online Learning for Unknown Partially Observable MDPs Mehdi Jafarnia Jahromi, Rahul Jain, Ashutosh Nayyar
ICML 2022 Improved No-Regret Algorithms for Stochastic Shortest Path with Linear MDP Liyu Chen, Rahul Jain, Haipeng Luo
ICML 2022 Learning Infinite-Horizon Average-Reward Markov Decision Process with Constraints Liyu Chen, Rahul Jain, Haipeng Luo
NeurIPSW 2022 Learning Neuro-Symbolic Programs for Language-Guided Robotic Manipulation Namasivayam Kalithasan, Himanshu Gaurav Singh, Vishal Bindal, Arnav Tuli, Vishwajeet Agrawal, Rahul Jain, Parag Singla, Rohan Paul
NeurIPS 2022 Matrix Multiplicative Weights Updates in Quantum Zero-Sum Games: Conservation Laws & Recurrence Rahul Jain, Georgios Piliouras, Ryann Sim
UAI 2022 Optimal Control of Partially Observable Markov Decision Processes with Finite Linear Temporal Logic Constraints Krishna C. Kalagarla, Kartik Dhruva, Dongming Shen, Rahul Jain, Ashutosh Nayyar, Pierluigi Nuzzo
AISTATS 2021 Learning Infinite-Horizon Average-Reward MDPs with Linear Function Approximation Chen-Yu Wei, Mehdi Jafarnia Jahromi, Haipeng Luo, Rahul Jain
AAAI 2021 A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with Constraints Krishna Chaitanya Kalagarla, Rahul Jain, Pierluigi Nuzzo
NeurIPS 2021 Implicit Finite-Horizon Approximation and Efficient Optimal Algorithms for Stochastic Shortest Path Liyu Chen, Mehdi Jafarnia-Jahromi, Rahul Jain, Haipeng Luo
ICML 2020 Model-Free Reinforcement Learning in Infinite-Horizon Average-Reward Markov Decision Processes Chen-Yu Wei, Mehdi Jafarnia Jahromi, Haipeng Luo, Hiteshi Sharma, Rahul Jain
UAI 2019 Approximate Relative Value Learning for Average-Reward Continuous State MDPs Hiteshi Sharma, Mehdi Jafarnia-Jahromi, Rahul Jain
NeurIPS 2017 Learning Unknown Markov Decision Processes: A Thompson Sampling Approach Yi Ouyang, Mukul Gagrani, Ashutosh Nayyar, Rahul Jain