Stone, Peter
188 publications
CoRL
2025
ComposableNav: Instruction-Following Navigation in Dynamic Environments via Composable Diffusion
NeurIPS
2025
Evaluating Generalization Capabilities of LLM-Based Agents in Mixed-Motive Scenarios Using Concordia
AAAI
2025
The Essentials of AI for Life and Society: An AI Literacy Course for the University Community
NeurIPS
2024
Discovering Creative Behaviors Through DUPLEX: Diverse Universal Features for Policy Exploration
NeurIPS
2024
Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning
ICLR
2024
Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks
CoLLAs
2024
T-DGR: A Trajectory-Based Deep Generative Replay Method for Continual Learning in Decision Making
ICLR
2023
MACTA: A Multi-Agent Reinforcement Learning Approach for Cache Timing Attacks and Detection
CoRL
2023
STERLING: Self-Supervised Terrain Representation Learning from Unconstrained Robot Experience
NeurIPSW
2023
T-DGR: A Trajectory-Based Deep Generative Replay Method for Continual Learning in Decision Making
IJCAI
2020
Balancing Individual Preferences and Shared Objectives in Multiagent Reinforcement Learning
JAIR
2020
Jointly Improving Parsing and Perception for Natural Language Commands Through Human-Robot Dialog
AAAI
2015
CORPP: Commonsense Reasoning and Probabilistic Planning, as Applied to Dialog with a Mobile Robot
AAAI
2015
Cooperating with Unknown Teammates in Complex Domains: A Robot Soccer Case Study of Ad Hoc Teamwork
AAAI
2015
SCRAM: Scalable Collision-Avoiding Role Assignment with Minimal-Makespan for Formational Positioning
AAAI
2015
UT Austin Villa 2014: RoboCup 3D Simulation League Champion via Overlapping Layered Learning
IJCAI
2015
When Security Games Go Green: Designing Defender Strategies to Prevent Poaching and Illegal Fishing
ECML-PKDD
2013
Model-Selection for Non-Parametric Function Approximation in Continuous Control Problems: A Case Study in a Smart Energy System
ICML
2011
Structure Learning in Ergodic Factored MDPs Without Knowledge of the Transition Function's In-Degree
ECML-PKDD
2010
Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-like Exploration
AAAI
2007
Temporal Difference and Policy Search Methods for Reinforcement Learning: An Empirical Comparison