ML Anthology
Authors
Search
About
Swamy, Gokul
29 publications
NeurIPS
2025
A Smooth Sea Never Made a Skilled SAILOR: Robust Imitation via Learning to Search
Arnav Kumar Jain
,
Vibhakar Mohta
,
Subin Kim
,
Atiksh Bhardwaj
,
Juntao Ren
,
Yunhai Feng
,
Sanjiban Choudhury
,
Gokul Swamy
ICLR
2025
Diffusing States and Matching Scores: A New Framework for Imitation Learning
Runzhe Wu
,
Yiding Chen
,
Gokul Swamy
,
Kianté Brantley
,
Wen Sun
ICLR
2025
Efficient Imitation Under Misspecification
Nicolas Espinosa-Dice
,
Sanjiban Choudhury
,
Wen Sun
,
Gokul Swamy
ICLRW
2025
From Foresight to Forethought: VLM-in-the-Loop Policy Steering via Latent Alignment
Yilin Wu
,
Ran Tian
,
Gokul Swamy
,
Andrea Bajcsy
ICLR
2025
Regressing the Relative Future: Efficient Policy Optimization for Multi-Turn RLHF
Zhaolin Gao
,
Wenhao Zhan
,
Jonathan Daniel Chang
,
Gokul Swamy
,
Kianté Brantley
,
Jason D. Lee
,
Wen Sun
NeurIPS
2025
Scaling Offline RL via Efficient and Expressive Shortcut Models
Nicolas Espinosa-Dice
,
Yiyi Zhang
,
Yiding Chen
,
Bradley Guo
,
Owen Oertell
,
Gokul Swamy
,
Kianté Brantley
,
Wen Sun
ICML
2024
A Minimaximalist Approach to Reinforcement Learning from Human Feedback
Gokul Swamy
,
Christoph Dann
,
Rahul Kidambi
,
Steven Wu
,
Alekh Agarwal
ICMLW
2024
Efficient Inverse Reinforcement Learning Without Compounding Errors
Nicolas Espinosa Dice
,
Gokul Swamy
,
Sanjiban Choudhury
,
Wen Sun
ICML
2024
EvIL: Evolution Strategies for Generalisable Imitation Learning
Silvia Sapora
,
Gokul Swamy
,
Chris Lu
,
Yee Whye Teh
,
Jakob Nicolaus Foerster
ICML
2024
Hybrid Inverse Reinforcement Learning
Juntao Ren
,
Gokul Swamy
,
Steven Wu
,
Drew Bagnell
,
Sanjiban Choudhury
NeurIPS
2024
Multi-Agent Imitation Learning: Value Is Easy, Regret Is Hard
Jingwu Tang
,
Gokul Swamy
,
Fei Fang
,
Zhiwei Steven Wu
ICMLW
2024
Multi-Agent Imitation Learning: Value Is Easy, Regret Is Hard
Jingwu Tang
,
Gokul Swamy
,
Fei Fang
,
Steven Wu
ICMLW
2024
Multi-Agent Imitation Learning: Value Is Easy, Regret Is Hard
Jingwu Tang
,
Gokul Swamy
,
Fei Fang
,
Steven Wu
NeurIPS
2024
REBEL: Reinforcement Learning via Regressing Relative Rewards
Zhaolin Gao
,
Jonathan D. Chang
,
Wenhao Zhan
,
Owen Oertell
,
Gokul Swamy
,
Kianté Brantley
,
Thorsten Joachims
,
J. Andrew Bagnell
,
Jason D. Lee
,
Wen Sun
ICMLW
2024
REBEL: Reinforcement Learning via Regressing Relative Rewards
Zhaolin Gao
,
Jonathan Daniel Chang
,
Wenhao Zhan
,
Owen Oertell
,
Gokul Swamy
,
Kianté Brantley
,
Thorsten Joachims
,
J. Andrew Bagnell
,
Jason D. Lee
,
Wen Sun
ICMLW
2024
REBEL: Reinforcement Learning via Regressing Relative Rewards
Zhaolin Gao
,
Jonathan Daniel Chang
,
Wenhao Zhan
,
Owen Oertell
,
Gokul Swamy
,
Kianté Brantley
,
Thorsten Joachims
,
J. Andrew Bagnell
,
Jason D. Lee
,
Wen Sun
NeurIPS
2024
The Importance of Online Data: Understanding Preference Fine-Tuning via Coverage
Yuda Song
,
Gokul Swamy
,
Aarti Singh
,
J. Andrew Bagnell
,
Wen Sun
ICMLW
2024
The Importance of Online Data: Understanding Preference Fine-Tuning via Coverage
Yuda Song
,
Gokul Swamy
,
Aarti Singh
,
Drew Bagnell
,
Wen Sun
ICML
2024
When Is Transfer Learning Possible?
My Phan
,
Kianté Brantley
,
Stephanie Milani
,
Soroush Mehri
,
Gokul Swamy
,
Geoffrey J. Gordon
ICMLW
2023
Complementing a Policy with a Different Observation Space
Gokul Swamy
,
Sanjiban Choudhury
,
Drew Bagnell
,
Steven Wu
ICML
2023
Inverse Reinforcement Learning Without Reinforcement Learning
Gokul Swamy
,
David Wu
,
Sanjiban Choudhury
,
Drew Bagnell
,
Steven Wu
NeurIPS
2023
Learning Shared Safety Constraints from Multi-Task Demonstrations
Konwoo Kim
,
Gokul Swamy
,
Zuxin Liu
,
Ding Zhao
,
Sanjiban Choudhury
,
Steven Z. Wu
ICMLW
2023
Learning Shared Safety Constraints from Multi-Task Demonstrations
Konwoo Kim
,
Gokul Swamy
,
Zuxin Liu
,
Ding Zhao
,
Sanjiban Choudhury
,
Steven Wu
ICMLW
2023
Learning Shared Safety Constraints from Multi-Task Demonstrations
Konwoo Kim
,
Gokul Swamy
,
Zuxin Liu
,
Ding Zhao
,
Sanjiban Choudhury
,
Steven Wu
ICML
2022
Causal Imitation Learning Under Temporally Correlated Noise
Gokul Swamy
,
Sanjiban Choudhury
,
Drew Bagnell
,
Steven Wu
NeurIPS
2022
Minimax Optimal Online Imitation Learning via Replay Estimation
Gokul Swamy
,
Nived Rajaraman
,
Matt Peng
,
Sanjiban Choudhury
,
J. A. Bagnell
,
Steven Z. Wu
,
Jiantao Jiao
,
Kannan Ramchandran
NeurIPS
2022
Sequence Model Imitation Learning with Unobserved Contexts
Gokul Swamy
,
Sanjiban Choudhury
,
J. A. Bagnell
,
Steven Z. Wu
ICML
2021
Of Moments and Matching: A Game-Theoretic Framework for Closing the Imitation Gap
Gokul Swamy
,
Sanjiban Choudhury
,
J. Andrew Bagnell
,
Steven Wu
NeurIPSW
2021
What Would the Expert $do(\cdot)$?: Causal Imitation Learning
Gokul Swamy
,
Sanjiban Choudhury
,
Drew Bagnell
,
Steven Wu