ML Anthology
Authors
Search
About
Weaver, Lex
5 publications
ICML
2001
A Multi-Agent Policy-Gradient Approach to Network Routing
Nigel Tao
,
Jonathan Baxter
,
Lex Weaver
JAIR
2001
Experiments with Infinite-Horizon, Policy-Gradient Estimation
Jonathan Baxter
,
Peter L. Bartlett
,
Lex Weaver
UAI
2001
The Optimal Reward Baseline for Gradient-Based Reinforcement Learning
Lex Weaver
,
Nigel Tao
MLJ
2000
Learning to Play Chess Using Temporal Differences
Jonathan Baxter
,
Andrew Tridgell
,
Lex Weaver
ICML
1998
KnightCap: A Chess Programm That Learns by Combining TD(lambda) with Game-Tree Search
Jonathan Baxter
,
Andrew Tridgell
,
Lex Weaver