To Max or Not to Max: Online Learning for Speeding up Optimal Planning
Abstract
It is well known that there cannot be a single "best" heuristic for optimal planning in general. One way of overcoming this is by combining admissible heuristics (e.g. by using their maximum), which requires computing numerous heuristic estimates at each state. However, there is a tradeoff between the time spent on computing these heuristic estimates for each state, and the time saved by reducing the number of expanded states. We present a novel method that reduces the cost of combining admissible heuristics for optimal search, while maintaining its benefits. Based on an idealized search space model, we formulate a decision rule for choosing the best heuristic to compute at each state. We then present an active online learning approach for that decision rule, and employ the learned model to decide which heuristic to compute at each state. We evaluate this technique empirically, and show that it substantially outperforms each of the individual heuristics that were used, as well as their regular maximum.
Cite
Text
Domshlak et al. "To Max or Not to Max: Online Learning for Speeding up Optimal Planning." AAAI Conference on Artificial Intelligence, 2010. doi:10.1609/AAAI.V24I1.7741Markdown
[Domshlak et al. "To Max or Not to Max: Online Learning for Speeding up Optimal Planning." AAAI Conference on Artificial Intelligence, 2010.](https://mlanthology.org/aaai/2010/domshlak2010aaai-max/) doi:10.1609/AAAI.V24I1.7741BibTeX
@inproceedings{domshlak2010aaai-max,
title = {{To Max or Not to Max: Online Learning for Speeding up Optimal Planning}},
author = {Domshlak, Carmel and Karpas, Erez and Markovitch, Shaul},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2010},
pages = {1071-1076},
doi = {10.1609/AAAI.V24I1.7741},
url = {https://mlanthology.org/aaai/2010/domshlak2010aaai-max/}
}