Barto, Andrew G.

47 publications

AAAI 2012 Adaptive Step-Size for Online Temporal Difference Learning William Dabney, Andrew G. Barto
ICML 2012 Learning Parameterized Skills Bruno Castro da Silva, George Dimitri Konidaris, Andrew G. Barto
AAAI 2012 TD-DeltaPi: A Model-Free Algorithm for Efficient Exploration Bruno Castro da Silva, Andrew G. Barto
AAAI 2011 Autonomous Skill Acquisition on a Mobile Manipulator George Dimitri Konidaris, Scott Kuindersma, Roderic A. Grupen, Andrew G. Barto
NeurIPS 2011 Clustering via Dirichlet Process Mixture Models for Portable Skill Discovery Scott Niekum, Andrew G. Barto
ICML 2011 Conjugate Markov Decision Processes Philip S. Thomas, Andrew G. Barto
NeurIPS 2010 Constructing Skill Trees for Reinforcement Learning Agents from Demonstration Trajectories George Konidaris, Scott Kuindersma, Roderic Grupen, Andrew G. Barto
IJCAI 2009 Efficient Skill Learning Using Abstraction Selection George Dimitri Konidaris, Andrew G. Barto
NeurIPS 2009 Skill Discovery in Continuous Reinforcement Learning Domains Using Skill Chaining George Konidaris, Andrew G. Barto
NeurIPS 2008 Skill Characterization Based on Betweenness Ozgur Simsek, Andrew G. Barto
IJCAI 2007 Building Portable Options: Skill Transfer in Reinforcement Learning George Dimitri Konidaris, Andrew G. Barto
IJCAI 2007 Deictic Option Schemas Balaraman Ravindran, Andrew G. Barto, Vimal Mathew
ICML 2006 An Intrinsic Reward Mechanism for Efficient Exploration Özgür Simsek, Andrew G. Barto
ICML 2006 Autonomous Shaping: Knowledge Transfer in Reinforcement Learning George Dimitri Konidaris, Andrew G. Barto
AAAI 2006 Decision Tree Methods for Finding Reusable MDP Homomorphisms Alicia P. Wolfe, Andrew G. Barto
ICML 2005 A Causal Approach to Hierarchical Decomposition of Factored MDPs Anders Jonsson, Andrew G. Barto
ICML 2005 Identifying Useful Subgoals in Reinforcement Learning by Local Graph Partitioning Özgür Simsek, Alicia P. Wolfe, Andrew G. Barto
NeurIPS 2004 Intrinsically Motivated Reinforcement Learning Nuttapong Chentanez, Andrew G. Barto, Satinder P. Singh
ICML 2004 Using Relative Novelty to Identify Useful Temporal Abstractions in Reinforcement Learning Özgür Simsek, Andrew G. Barto
ICML 2003 Relativized Options: Choosing the Right Transformation Balaraman Ravindran, Andrew G. Barto
IJCAI 2003 SMDP Homomorphisms: An Algebraic Approach to Abstraction in Semi-Markov Decision Processes Balaraman Ravindran, Andrew G. Barto
MLJ 2002 Building a Basic Block Instruction Scheduler with Reinforcement Learning and Rollouts Amy McGovern, J. Eliot B. Moss, Andrew G. Barto
JMLR 2002 Lyapunov Design for Safe Reinforcement Learning Theodore J. Perkins, Andrew G. Barto
ICML 2002 PolicyBlocks: An Algorithm for Creating Useful Macro-Actions in Reinforcement Learning Marc Pickett, Andrew G. Barto
ICML 2001 Automatic Discovery of Subgoals in Reinforcement Learning Using Diverse Density Amy McGovern, Andrew G. Barto
IJCAI 2001 Heuristic Search in Infinite State Spaces Guided by Lyapunov Analysis Theodore J. Perkins, Andrew G. Barto
ICML 2001 Lyapunov-Constrained Action Sets for Reinforcement Learning Theodore J. Perkins, Andrew G. Barto
IJCAI 2001 Robot Weightlifting by Direct Policy Search Michael T. Rosenstein, Andrew G. Barto
NeurIPS 2001 The Emergence of Multiple Movement Units in the Presence of Noise and Feedback Delay Michael Kositsky, Andrew G. Barto
NeurIPS 2000 Automated State Abstraction for Options Using the U-Tree Algorithm Anders Jonsson, Andrew G. Barto
ICML 2000 Combining Reinforcement Learning with a Local Control Algorithm Jette Randløv, Andrew G. Barto, Michael T. Rosenstein
ICML 2000 Machine Learning for Subproblem Selection Robert Moll, Theodore J. Perkins, Andrew G. Barto
NeCo 1999 A Cerebellar Model of Timing and Prediction in the Control of Reaching Andrew G. Barto, Andrew H. Fagg, Nathan Sitkoff, James C. Houk
MLJ 1998 Elevator Group Control Using Multiple Reinforcement Learning Agents Robert H. Crites, Andrew G. Barto
NeurIPS 1998 Learning Instance-Independent Value Functions to Enhance Local Search Robert Moll, Andrew G. Barto, Theodore J. Perkins, Richard S. Sutton
NeurIPS 1997 Automated Aircraft Recovery via Reinforcement Learning: Initial Experiments Jeffrey F. Monaco, David G. Ward, Andrew G. Barto
MLJ 1996 Linear Least-Squares Algorithms for Temporal Difference Learning Steven J. Bradtke, Andrew G. Barto
NeurIPS 1996 Local Bandit Approximation for Optimal Learning Problems Michael O. Duff, Andrew G. Barto
NeurIPS 1996 Reinforcement Learning for Mixed Open-Loop and Closed-Loop Control Eric A. Hansen, Andrew G. Barto, Shlomo Zilberstein
NeurIPS 1996 Text-Based Information Retrieval Using Exponentiated Gradient Descent Ron Papka, James P. Callan, Andrew G. Barto
NeurIPS 1995 A Predictive Switching Model of Cerebellar Movement Control Andrew G. Barto, James C. Houk
NeurIPS 1995 Improving Elevator Performance Using Reinforcement Learning Robert H. Crites, Andrew G. Barto
NeurIPS 1994 An Actor/Critic Algorithm That Is Equivalent to Q-Learning Robert H. Crites, Andrew G. Barto
NeurIPS 1993 Convergence of Indirect Adaptive Asynchronous Value Iteration Algorithms Vijaykumar Gullapalli, Andrew G. Barto
NeurIPS 1993 Robust Reinforcement Learning in Motion Planning Satinder P. Singh, Andrew G. Barto, Roderic Grupen, Christopher Connolly
AAAI 1990 Explaining Temporal Differences to Create Useful Concepts for Evaluating States Richard C. Yee, Sharad Saxena, Paul E. Utgoff, Andrew G. Barto
IJCAI 1985 Training and Tracking in Robotics Oliver G. Selfridge, Richard S. Sutton, Andrew G. Barto