ML Anthology
Authors
Search
About
Barto, Andrew G.
47 publications
AAAI
2012
Adaptive Step-Size for Online Temporal Difference Learning
William Dabney
,
Andrew G. Barto
ICML
2012
Learning Parameterized Skills
Bruno Castro da Silva
,
George Dimitri Konidaris
,
Andrew G. Barto
AAAI
2012
TD-DeltaPi: A Model-Free Algorithm for Efficient Exploration
Bruno Castro da Silva
,
Andrew G. Barto
AAAI
2011
Autonomous Skill Acquisition on a Mobile Manipulator
George Dimitri Konidaris
,
Scott Kuindersma
,
Roderic A. Grupen
,
Andrew G. Barto
NeurIPS
2011
Clustering via Dirichlet Process Mixture Models for Portable Skill Discovery
Scott Niekum
,
Andrew G. Barto
ICML
2011
Conjugate Markov Decision Processes
Philip S. Thomas
,
Andrew G. Barto
NeurIPS
2010
Constructing Skill Trees for Reinforcement Learning Agents from Demonstration Trajectories
George Konidaris
,
Scott Kuindersma
,
Roderic Grupen
,
Andrew G. Barto
IJCAI
2009
Efficient Skill Learning Using Abstraction Selection
George Dimitri Konidaris
,
Andrew G. Barto
NeurIPS
2009
Skill Discovery in Continuous Reinforcement Learning Domains Using Skill Chaining
George Konidaris
,
Andrew G. Barto
NeurIPS
2008
Skill Characterization Based on Betweenness
Ozgur Simsek
,
Andrew G. Barto
IJCAI
2007
Building Portable Options: Skill Transfer in Reinforcement Learning
George Dimitri Konidaris
,
Andrew G. Barto
IJCAI
2007
Deictic Option Schemas
Balaraman Ravindran
,
Andrew G. Barto
,
Vimal Mathew
ICML
2006
An Intrinsic Reward Mechanism for Efficient Exploration
Özgür Simsek
,
Andrew G. Barto
ICML
2006
Autonomous Shaping: Knowledge Transfer in Reinforcement Learning
George Dimitri Konidaris
,
Andrew G. Barto
AAAI
2006
Decision Tree Methods for Finding Reusable MDP Homomorphisms
Alicia P. Wolfe
,
Andrew G. Barto
ICML
2005
A Causal Approach to Hierarchical Decomposition of Factored MDPs
Anders Jonsson
,
Andrew G. Barto
ICML
2005
Identifying Useful Subgoals in Reinforcement Learning by Local Graph Partitioning
Özgür Simsek
,
Alicia P. Wolfe
,
Andrew G. Barto
NeurIPS
2004
Intrinsically Motivated Reinforcement Learning
Nuttapong Chentanez
,
Andrew G. Barto
,
Satinder P. Singh
ICML
2004
Using Relative Novelty to Identify Useful Temporal Abstractions in Reinforcement Learning
Özgür Simsek
,
Andrew G. Barto
ICML
2003
Relativized Options: Choosing the Right Transformation
Balaraman Ravindran
,
Andrew G. Barto
IJCAI
2003
SMDP Homomorphisms: An Algebraic Approach to Abstraction in Semi-Markov Decision Processes
Balaraman Ravindran
,
Andrew G. Barto
MLJ
2002
Building a Basic Block Instruction Scheduler with Reinforcement Learning and Rollouts
Amy McGovern
,
J. Eliot B. Moss
,
Andrew G. Barto
JMLR
2002
Lyapunov Design for Safe Reinforcement Learning
Theodore J. Perkins
,
Andrew G. Barto
ICML
2002
PolicyBlocks: An Algorithm for Creating Useful Macro-Actions in Reinforcement Learning
Marc Pickett
,
Andrew G. Barto
ICML
2001
Automatic Discovery of Subgoals in Reinforcement Learning Using Diverse Density
Amy McGovern
,
Andrew G. Barto
IJCAI
2001
Heuristic Search in Infinite State Spaces Guided by Lyapunov Analysis
Theodore J. Perkins
,
Andrew G. Barto
ICML
2001
Lyapunov-Constrained Action Sets for Reinforcement Learning
Theodore J. Perkins
,
Andrew G. Barto
IJCAI
2001
Robot Weightlifting by Direct Policy Search
Michael T. Rosenstein
,
Andrew G. Barto
NeurIPS
2001
The Emergence of Multiple Movement Units in the Presence of Noise and Feedback Delay
Michael Kositsky
,
Andrew G. Barto
NeurIPS
2000
Automated State Abstraction for Options Using the U-Tree Algorithm
Anders Jonsson
,
Andrew G. Barto
ICML
2000
Combining Reinforcement Learning with a Local Control Algorithm
Jette Randløv
,
Andrew G. Barto
,
Michael T. Rosenstein
ICML
2000
Machine Learning for Subproblem Selection
Robert Moll
,
Theodore J. Perkins
,
Andrew G. Barto
NeCo
1999
A Cerebellar Model of Timing and Prediction in the Control of Reaching
Andrew G. Barto
,
Andrew H. Fagg
,
Nathan Sitkoff
,
James C. Houk
MLJ
1998
Elevator Group Control Using Multiple Reinforcement Learning Agents
Robert H. Crites
,
Andrew G. Barto
NeurIPS
1998
Learning Instance-Independent Value Functions to Enhance Local Search
Robert Moll
,
Andrew G. Barto
,
Theodore J. Perkins
,
Richard S. Sutton
NeurIPS
1997
Automated Aircraft Recovery via Reinforcement Learning: Initial Experiments
Jeffrey F. Monaco
,
David G. Ward
,
Andrew G. Barto
MLJ
1996
Linear Least-Squares Algorithms for Temporal Difference Learning
Steven J. Bradtke
,
Andrew G. Barto
NeurIPS
1996
Local Bandit Approximation for Optimal Learning Problems
Michael O. Duff
,
Andrew G. Barto
NeurIPS
1996
Reinforcement Learning for Mixed Open-Loop and Closed-Loop Control
Eric A. Hansen
,
Andrew G. Barto
,
Shlomo Zilberstein
NeurIPS
1996
Text-Based Information Retrieval Using Exponentiated Gradient Descent
Ron Papka
,
James P. Callan
,
Andrew G. Barto
NeurIPS
1995
A Predictive Switching Model of Cerebellar Movement Control
Andrew G. Barto
,
James C. Houk
NeurIPS
1995
Improving Elevator Performance Using Reinforcement Learning
Robert H. Crites
,
Andrew G. Barto
NeurIPS
1994
An Actor/Critic Algorithm That Is Equivalent to Q-Learning
Robert H. Crites
,
Andrew G. Barto
NeurIPS
1993
Convergence of Indirect Adaptive Asynchronous Value Iteration Algorithms
Vijaykumar Gullapalli
,
Andrew G. Barto
NeurIPS
1993
Robust Reinforcement Learning in Motion Planning
Satinder P. Singh
,
Andrew G. Barto
,
Roderic Grupen
,
Christopher Connolly
AAAI
1990
Explaining Temporal Differences to Create Useful Concepts for Evaluating States
Richard C. Yee
,
Sharad Saxena
,
Paul E. Utgoff
,
Andrew G. Barto
IJCAI
1985
Training and Tracking in Robotics
Oliver G. Selfridge
,
Richard S. Sutton
,
Andrew G. Barto