Structure and Randomness in Planning and Reinforcement Learning
Abstract
Planning in large state spaces inevitably needs to balance depth and breadth of the search. It has a crucial impact on planners performance and most manage this interplay implicitly. We present a novel method \textit{Shoot Tree Search (STS)}, which makes it possible to control this trade-off more explicitly. Our algorithm can be understood as an interpolation between two celebrated search mechanisms: MCTS and random shooting. It also lets the user control the bias-variance trade-off, akin to $TD(n)$, but in the tree search context. In experiments on challenging domains, we show that STS can get the best of both worlds consistently achieving higher scores.
Cite
Text
Kozakowski et al. "Structure and Randomness in Planning and Reinforcement Learning." NeurIPS 2020 Workshops: LMCA, 2020.Markdown
[Kozakowski et al. "Structure and Randomness in Planning and Reinforcement Learning." NeurIPS 2020 Workshops: LMCA, 2020.](https://mlanthology.org/neuripsw/2020/kozakowski2020neuripsw-structure/)BibTeX
@inproceedings{kozakowski2020neuripsw-structure,
title = {{Structure and Randomness in Planning and Reinforcement Learning}},
author = {Kozakowski, Piotr and Januszewski, Piotr and Czechowski, Konrad and Kuciński, Łukasz and Miłoś, Piotr},
booktitle = {NeurIPS 2020 Workshops: LMCA},
year = {2020},
url = {https://mlanthology.org/neuripsw/2020/kozakowski2020neuripsw-structure/}
}