Baxter, Jonathan

21 publications

JMLR 2004 Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning Evan Greensmith, Peter L. Bartlett, Jonathan Baxter
ICML 2002 Scalable Internal-State Policy-Gradient Methods for POMDPs Douglas Aberdeen, Jonathan Baxter
ICML 2001 A Multi-Agent Policy-Gradient Approach to Network Routing Nigel Tao, Jonathan Baxter, Lex Weaver
JAIR 2001 Experiments with Infinite-Horizon, Policy-Gradient Estimation Jonathan Baxter, Peter L. Bartlett, Lex Weaver
JAIR 2001 Infinite-Horizon Policy-Gradient Estimation Jonathan Baxter, Peter L. Bartlett
NeurIPS 2001 Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning Evan Greensmith, Peter L. Bartlett, Jonathan Baxter
JAIR 2000 A Model of Inductive Bias Learning Jonathan Baxter
COLT 2000 Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning Peter L. Bartlett, Jonathan Baxter
MLJ 2000 Improved Generalization Through Explicit Optimization of Margins Llew Mason, Peter L. Bartlett, Jonathan Baxter
MLJ 2000 Learning to Play Chess Using Temporal Differences Jonathan Baxter, Andrew Tridgell, Lex Weaver
ICML 2000 Reinforcement Learning in POMDP's via Direct Gradient Ascent Jonathan Baxter, Peter L. Bartlett
NeurIPS 1999 Boosting Algorithms as Gradient Descent Llew Mason, Jonathan Baxter, Peter L. Bartlett, Marcus R. Frean
MLJ 1999 Guest Editors' Introduction Jonathan Baxter, Nicolò Cesa-Bianchi
NeurIPS 1998 Direct Optimization of Margins Improves Generalization in Combined Classifiers Llew Mason, Peter L. Bartlett, Jonathan Baxter
ICML 1998 KnightCap: A Chess Programm That Learns by Combining TD(lambda) with Game-Tree Search Jonathan Baxter, Andrew Tridgell, Lex Weaver
MLJ 1997 A Bayesian/Information Theoretic Model of Learning to Learn via Multiple Task Sampling Jonathan Baxter
ICML 1997 The Canonical Distortion Measure for Vector Quantization and Function Approximation Jonathan Baxter
NeurIPS 1997 The Canonical Distortion Measure in Feature Space and 1-NN Classification Jonathan Baxter, Peter L. Bartlett
COLT 1996 A Bayesian/Information Theoretic Model of Bias Learning Jonathan Baxter
COLT 1995 Learning Internal Representations Jonathan Baxter
NeurIPS 1995 Learning Model Bias Jonathan Baxter