Learning Measures of Progress for Planning Domains

Abstract

We study an approach to learning heuristics for planning do-mains from example solutions. There has been little work on learning heuristics for the types of domains used in determin-istic and stochastic planning competitions. Perhaps one rea-son for this is the challenge of providing a compact heuristic language that facilitates learning. Here we introduce a new representation for heuristics based on lists of set expressions described using taxonomic syntax. Next, we review the idea of a measure of progress (Parmar 2002), which is any heuris-tic that is guaranteed to be improvable at every state. We take finding a measure of progress as our learning goal, and describe a simple learning algorithm for this purpose. We evaluate our approach across a range of deterministic and stochastic planning-competition domains. The results show that often greedily following the learned heuristic is highly effective. We also show our heuristic can be combined with learned rule-based policies, producing still stronger results.

Cite

Text

Yoon et al. "Learning Measures of Progress for Planning Domains." AAAI Conference on Artificial Intelligence, 2005.

Markdown

[Yoon et al. "Learning Measures of Progress for Planning Domains." AAAI Conference on Artificial Intelligence, 2005.](https://mlanthology.org/aaai/2005/yoon2005aaai-learning/)

BibTeX

@inproceedings{yoon2005aaai-learning,
  title     = {{Learning Measures of Progress for Planning Domains}},
  author    = {Yoon, Sung Wook and Fern, Alan and Givan, Robert},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2005},
  pages     = {1217-1222},
  url       = {https://mlanthology.org/aaai/2005/yoon2005aaai-learning/}
}