ML Anthology
Authors
Search
About
Baxter, Jonathan
21 publications
JMLR
2004
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
Evan Greensmith
,
Peter L. Bartlett
,
Jonathan Baxter
ICML
2002
Scalable Internal-State Policy-Gradient Methods for POMDPs
Douglas Aberdeen
,
Jonathan Baxter
ICML
2001
A Multi-Agent Policy-Gradient Approach to Network Routing
Nigel Tao
,
Jonathan Baxter
,
Lex Weaver
JAIR
2001
Experiments with Infinite-Horizon, Policy-Gradient Estimation
Jonathan Baxter
,
Peter L. Bartlett
,
Lex Weaver
JAIR
2001
Infinite-Horizon Policy-Gradient Estimation
Jonathan Baxter
,
Peter L. Bartlett
NeurIPS
2001
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
Evan Greensmith
,
Peter L. Bartlett
,
Jonathan Baxter
JAIR
2000
A Model of Inductive Bias Learning
Jonathan Baxter
COLT
2000
Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning
Peter L. Bartlett
,
Jonathan Baxter
MLJ
2000
Improved Generalization Through Explicit Optimization of Margins
Llew Mason
,
Peter L. Bartlett
,
Jonathan Baxter
MLJ
2000
Learning to Play Chess Using Temporal Differences
Jonathan Baxter
,
Andrew Tridgell
,
Lex Weaver
ICML
2000
Reinforcement Learning in POMDP's via Direct Gradient Ascent
Jonathan Baxter
,
Peter L. Bartlett
NeurIPS
1999
Boosting Algorithms as Gradient Descent
Llew Mason
,
Jonathan Baxter
,
Peter L. Bartlett
,
Marcus R. Frean
MLJ
1999
Guest Editors' Introduction
Jonathan Baxter
,
Nicolò Cesa-Bianchi
NeurIPS
1998
Direct Optimization of Margins Improves Generalization in Combined Classifiers
Llew Mason
,
Peter L. Bartlett
,
Jonathan Baxter
ICML
1998
KnightCap: A Chess Programm That Learns by Combining TD(lambda) with Game-Tree Search
Jonathan Baxter
,
Andrew Tridgell
,
Lex Weaver
MLJ
1997
A Bayesian/Information Theoretic Model of Learning to Learn via Multiple Task Sampling
Jonathan Baxter
ICML
1997
The Canonical Distortion Measure for Vector Quantization and Function Approximation
Jonathan Baxter
NeurIPS
1997
The Canonical Distortion Measure in Feature Space and 1-NN Classification
Jonathan Baxter
,
Peter L. Bartlett
COLT
1996
A Bayesian/Information Theoretic Model of Bias Learning
Jonathan Baxter
COLT
1995
Learning Internal Representations
Jonathan Baxter
NeurIPS
1995
Learning Model Bias
Jonathan Baxter