Learning Subgoal Sequences for Planning

Abstract

A learning problem solver consists of three components: (1) a problem solver, (2) a memory of problem-solving knowledge, and (3) a learning component for deriving new problemsolving knowledge from experience. Previous learning problem solvers have acquired knowledge as macros, control knowledge, or cases. SteppingStone is a general learning problem solver that improves its performance by learning subgoal sequences. The underlying problem solver for SteppingStone is a combination of means-ends analysis and brute-force search. It depends primarily upon means-ends analysis and its problem-solving knowledge to solve the subgoals of the problem. Learning occurs only when this approach fails. Upon failure, search is applied and a new subgoal sequence is derived and added to its problem-solving knowledge. Before SteppingStone attempts to solve any of the problem subgoals, it first orders them with a domain independent heuristic which we call openness. Openness is used to order the subgoals to minimize interactions. Stepping-Stone's ability to improve its performance and scale to difficult problems is demonstrated with an implemented system. We show that a small memory of appropriate subgoals yields multiple orders of magnitude savings in problem-solving cost.

Cite

Text

Ruby and Kibler. "Learning Subgoal Sequences for Planning." International Joint Conference on Artificial Intelligence, 1989.

Markdown

[Ruby and Kibler. "Learning Subgoal Sequences for Planning." International Joint Conference on Artificial Intelligence, 1989.](https://mlanthology.org/ijcai/1989/ruby1989ijcai-learning/)

BibTeX

@inproceedings{ruby1989ijcai-learning,
  title     = {{Learning Subgoal Sequences for Planning}},
  author    = {Ruby, David and Kibler, Dennis F.},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {1989},
  pages     = {609-614},
  url       = {https://mlanthology.org/ijcai/1989/ruby1989ijcai-learning/}
}