Autonomous Learning of Action Models for Planning

Abstract

This paper introduces two new frameworks for learning action models for planning. In the mistake-bounded planning framework, the learner has access to a planner for the given model representation, a simulator, and a planning problem generator, and aims to learn a model with at most a polynomial number of faulty plans. In the planned exploration framework, the learner does not have access to a problem generator and must instead design its own problems, plan for them, and converge with at most a polynomial number of planning attempts. The paper reduces learning in these frameworks to concept learning with one-sided error and provides algorithms for successful learning in both frameworks. A specific family of hypothesis spaces is shown to be efficiently learnable in both the frameworks.

Cite

Text

Mehta et al. "Autonomous Learning of Action Models for Planning." Neural Information Processing Systems, 2011.

Markdown

[Mehta et al. "Autonomous Learning of Action Models for Planning." Neural Information Processing Systems, 2011.](https://mlanthology.org/neurips/2011/mehta2011neurips-autonomous/)

BibTeX

@inproceedings{mehta2011neurips-autonomous,
  title     = {{Autonomous Learning of Action Models for Planning}},
  author    = {Mehta, Neville and Tadepalli, Prasad and Fern, Alan},
  booktitle = {Neural Information Processing Systems},
  year      = {2011},
  pages     = {2465-2473},
  url       = {https://mlanthology.org/neurips/2011/mehta2011neurips-autonomous/}
}