Open Loop Execution of Tree-Search Algorithms
Abstract
In the context of tree-search stochastic planning algorithms where a generative model is available, we consider on-line planning algorithms building trees in order to recommend an action. We investigate the question of avoiding re-planning in subsequent decision steps by directly using sub-trees as action recommender. Firstly, we propose a method for open loop control via a new algorithm taking the decision of re-planning or not at each time step based on an analysis of the statistics of the sub-tree. Secondly, we show that the probability of selecting a suboptimal action at any depth of the tree can be upper bounded and converges towards zero. Moreover, this upper bound decays in a logarithmic way between subsequent depths. This leads to a distinction between node-wise optimality and state-wise optimality. Finally, we empirically demonstrate that our method achieves a compromise between loss of performance and computational gain.
Cite
Text
Lecarpentier et al. "Open Loop Execution of Tree-Search Algorithms." International Joint Conference on Artificial Intelligence, 2018. doi:10.24963/IJCAI.2018/327Markdown
[Lecarpentier et al. "Open Loop Execution of Tree-Search Algorithms." International Joint Conference on Artificial Intelligence, 2018.](https://mlanthology.org/ijcai/2018/lecarpentier2018ijcai-open/) doi:10.24963/IJCAI.2018/327BibTeX
@inproceedings{lecarpentier2018ijcai-open,
title = {{Open Loop Execution of Tree-Search Algorithms}},
author = {Lecarpentier, Erwan and Infantes, Guillaume and Lesire, Charles and Rachelson, Emmanuel},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2018},
pages = {2362-2368},
doi = {10.24963/IJCAI.2018/327},
url = {https://mlanthology.org/ijcai/2018/lecarpentier2018ijcai-open/}
}