Abbeel, Pieter
281 publications
NeurIPS
2025
Bigger, Regularized, Categorical: High-Capacity Value Functions Are Efficient Multi-Task Learners
NeurIPS
2025
Coarse-to-Fine Q-Network with Action Sequence for Data-Efficient Reinforcement Learning
ICLR
2025
MaxInfoRL: Boosting Exploration in Reinforcement Learning Through Information Gain Maximization
ICLR
2025
SEMDICE: Off-Policy State Entropy Maximization via Stationary Distribution Correction Estimation
CoRL
2025
The Sound of Simulation: Learning Multimodal Sim-to-Real Robot Policies with Generative Audio
ICMLW
2024
Compressing the Latent Space of Single-Sequence Protein Predictors for Multimodal Generation
CoRL
2024
Learning Robotic Locomotion Affordances and Photorealistic Simulators from Human-Captured Data
CoRL
2024
Reinforcement Learning with Foundation Priors: Let Embodied Agent Efficiently Learn on Its Own
NeurIPSW
2023
What Can AI Learn from Human Exploration? Intrinsically-Motivated Humans and Agents in Open-World Exploration
NeurIPSW
2023
What Can AI Learn from Human Exploration? Intrinsically-Motivated Humans and Agents in Open-World Exploration
NeurIPSW
2023
What Can AI Learn from Human Exploration? Intrinsically-Motivated Humans and Agents in Open-World Exploration
ICLRW
2022
Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning
ICML
2022
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents
NeurIPSW
2021
Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL
NeurIPS
2021
Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings
NeurIPSW
2021
SURF: Semi-Supervised Reward Learning with Data Augmentation for Feedback-Efficient Preference-Based Reinforcement Learning
NeurIPS
2020
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
NeurIPS
2020
Trajectory-Wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning
ICML
2019
Bit-Swap: Recursive Bits-Back Coding for Lossless Compression with Hierarchical Latent Variables
NeurIPS
2019
MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies
CoRL
2018
Composable Action-Conditioned Predictors: Flexible Off-Policy Learning for Robot Navigation
ICML
2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
ICML
2018
Universal Planning Networks: Learning Generalizable Representations for Visuomotor Control