Swamy, Gokul

29 publications

NeurIPS 2025 A Smooth Sea Never Made a Skilled SAILOR: Robust Imitation via Learning to Search Arnav Kumar Jain, Vibhakar Mohta, Subin Kim, Atiksh Bhardwaj, Juntao Ren, Yunhai Feng, Sanjiban Choudhury, Gokul Swamy
ICLR 2025 Diffusing States and Matching Scores: A New Framework for Imitation Learning Runzhe Wu, Yiding Chen, Gokul Swamy, Kianté Brantley, Wen Sun
ICLR 2025 Efficient Imitation Under Misspecification Nicolas Espinosa-Dice, Sanjiban Choudhury, Wen Sun, Gokul Swamy
ICLRW 2025 From Foresight to Forethought: VLM-in-the-Loop Policy Steering via Latent Alignment Yilin Wu, Ran Tian, Gokul Swamy, Andrea Bajcsy
ICLR 2025 Regressing the Relative Future: Efficient Policy Optimization for Multi-Turn RLHF Zhaolin Gao, Wenhao Zhan, Jonathan Daniel Chang, Gokul Swamy, Kianté Brantley, Jason D. Lee, Wen Sun
NeurIPS 2025 Scaling Offline RL via Efficient and Expressive Shortcut Models Nicolas Espinosa-Dice, Yiyi Zhang, Yiding Chen, Bradley Guo, Owen Oertell, Gokul Swamy, Kianté Brantley, Wen Sun
ICML 2024 A Minimaximalist Approach to Reinforcement Learning from Human Feedback Gokul Swamy, Christoph Dann, Rahul Kidambi, Steven Wu, Alekh Agarwal
ICMLW 2024 Efficient Inverse Reinforcement Learning Without Compounding Errors Nicolas Espinosa Dice, Gokul Swamy, Sanjiban Choudhury, Wen Sun
ICML 2024 EvIL: Evolution Strategies for Generalisable Imitation Learning Silvia Sapora, Gokul Swamy, Chris Lu, Yee Whye Teh, Jakob Nicolaus Foerster
ICML 2024 Hybrid Inverse Reinforcement Learning Juntao Ren, Gokul Swamy, Steven Wu, Drew Bagnell, Sanjiban Choudhury
NeurIPS 2024 Multi-Agent Imitation Learning: Value Is Easy, Regret Is Hard Jingwu Tang, Gokul Swamy, Fei Fang, Zhiwei Steven Wu
ICMLW 2024 Multi-Agent Imitation Learning: Value Is Easy, Regret Is Hard Jingwu Tang, Gokul Swamy, Fei Fang, Steven Wu
ICMLW 2024 Multi-Agent Imitation Learning: Value Is Easy, Regret Is Hard Jingwu Tang, Gokul Swamy, Fei Fang, Steven Wu
NeurIPS 2024 REBEL: Reinforcement Learning via Regressing Relative Rewards Zhaolin Gao, Jonathan D. Chang, Wenhao Zhan, Owen Oertell, Gokul Swamy, Kianté Brantley, Thorsten Joachims, J. Andrew Bagnell, Jason D. Lee, Wen Sun
ICMLW 2024 REBEL: Reinforcement Learning via Regressing Relative Rewards Zhaolin Gao, Jonathan Daniel Chang, Wenhao Zhan, Owen Oertell, Gokul Swamy, Kianté Brantley, Thorsten Joachims, J. Andrew Bagnell, Jason D. Lee, Wen Sun
ICMLW 2024 REBEL: Reinforcement Learning via Regressing Relative Rewards Zhaolin Gao, Jonathan Daniel Chang, Wenhao Zhan, Owen Oertell, Gokul Swamy, Kianté Brantley, Thorsten Joachims, J. Andrew Bagnell, Jason D. Lee, Wen Sun
NeurIPS 2024 The Importance of Online Data: Understanding Preference Fine-Tuning via Coverage Yuda Song, Gokul Swamy, Aarti Singh, J. Andrew Bagnell, Wen Sun
ICMLW 2024 The Importance of Online Data: Understanding Preference Fine-Tuning via Coverage Yuda Song, Gokul Swamy, Aarti Singh, Drew Bagnell, Wen Sun
ICML 2024 When Is Transfer Learning Possible? My Phan, Kianté Brantley, Stephanie Milani, Soroush Mehri, Gokul Swamy, Geoffrey J. Gordon
ICMLW 2023 Complementing a Policy with a Different Observation Space Gokul Swamy, Sanjiban Choudhury, Drew Bagnell, Steven Wu
ICML 2023 Inverse Reinforcement Learning Without Reinforcement Learning Gokul Swamy, David Wu, Sanjiban Choudhury, Drew Bagnell, Steven Wu
NeurIPS 2023 Learning Shared Safety Constraints from Multi-Task Demonstrations Konwoo Kim, Gokul Swamy, Zuxin Liu, Ding Zhao, Sanjiban Choudhury, Steven Z. Wu
ICMLW 2023 Learning Shared Safety Constraints from Multi-Task Demonstrations Konwoo Kim, Gokul Swamy, Zuxin Liu, Ding Zhao, Sanjiban Choudhury, Steven Wu
ICMLW 2023 Learning Shared Safety Constraints from Multi-Task Demonstrations Konwoo Kim, Gokul Swamy, Zuxin Liu, Ding Zhao, Sanjiban Choudhury, Steven Wu
ICML 2022 Causal Imitation Learning Under Temporally Correlated Noise Gokul Swamy, Sanjiban Choudhury, Drew Bagnell, Steven Wu
NeurIPS 2022 Minimax Optimal Online Imitation Learning via Replay Estimation Gokul Swamy, Nived Rajaraman, Matt Peng, Sanjiban Choudhury, J. A. Bagnell, Steven Z. Wu, Jiantao Jiao, Kannan Ramchandran
NeurIPS 2022 Sequence Model Imitation Learning with Unobserved Contexts Gokul Swamy, Sanjiban Choudhury, J. A. Bagnell, Steven Z. Wu
ICML 2021 Of Moments and Matching: A Game-Theoretic Framework for Closing the Imitation Gap Gokul Swamy, Sanjiban Choudhury, J. Andrew Bagnell, Steven Wu
NeurIPSW 2021 What Would the Expert $do(\cdot)$?: Causal Imitation Learning Gokul Swamy, Sanjiban Choudhury, Drew Bagnell, Steven Wu