Learning to Prune Dominated Action Sequences in Online Black-Box Planning
Abstract
Black-box domains where the successor states generated by applying an action are generated by a completely opaque simulator pose a challenge for domain-independent planning. The main computational bottleneck in search-based planning for such domains is the number of calls to the black-box simulation. We propose a method for significantly reducing the number of calls to the simulator by the search algorithm by detecting and pruning sequences of actions which are dominated by others. We apply our pruning method to Iterated Width and breadth-first search in domain-independent black-box planning for Atari 2600 games in the Arcade Learning Environment (ALE), adding our pruning method significantly improves upon the baseline algorithms.
Cite
Text
Jinnai and Fukunaga. "Learning to Prune Dominated Action Sequences in Online Black-Box Planning." AAAI Conference on Artificial Intelligence, 2017. doi:10.1609/AAAI.V31I1.10663Markdown
[Jinnai and Fukunaga. "Learning to Prune Dominated Action Sequences in Online Black-Box Planning." AAAI Conference on Artificial Intelligence, 2017.](https://mlanthology.org/aaai/2017/jinnai2017aaai-learning/) doi:10.1609/AAAI.V31I1.10663BibTeX
@inproceedings{jinnai2017aaai-learning,
title = {{Learning to Prune Dominated Action Sequences in Online Black-Box Planning}},
author = {Jinnai, Yuu and Fukunaga, Alex S.},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2017},
pages = {839-845},
doi = {10.1609/AAAI.V31I1.10663},
url = {https://mlanthology.org/aaai/2017/jinnai2017aaai-learning/}
}