Learning Measures of Progress for Planning Domains
Abstract
We study an approach to learning heuristics for planning do-mains from example solutions. There has been little work on learning heuristics for the types of domains used in determin-istic and stochastic planning competitions. Perhaps one rea-son for this is the challenge of providing a compact heuristic language that facilitates learning. Here we introduce a new representation for heuristics based on lists of set expressions described using taxonomic syntax. Next, we review the idea of a measure of progress (Parmar 2002), which is any heuris-tic that is guaranteed to be improvable at every state. We take finding a measure of progress as our learning goal, and describe a simple learning algorithm for this purpose. We evaluate our approach across a range of deterministic and stochastic planning-competition domains. The results show that often greedily following the learned heuristic is highly effective. We also show our heuristic can be combined with learned rule-based policies, producing still stronger results.
Cite
Text
Yoon et al. "Learning Measures of Progress for Planning Domains." AAAI Conference on Artificial Intelligence, 2005.Markdown
[Yoon et al. "Learning Measures of Progress for Planning Domains." AAAI Conference on Artificial Intelligence, 2005.](https://mlanthology.org/aaai/2005/yoon2005aaai-learning/)BibTeX
@inproceedings{yoon2005aaai-learning,
title = {{Learning Measures of Progress for Planning Domains}},
author = {Yoon, Sung Wook and Fern, Alan and Givan, Robert},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2005},
pages = {1217-1222},
url = {https://mlanthology.org/aaai/2005/yoon2005aaai-learning/}
}