Reducing the Planning Horizon Through Reinforcement Learning

Dunbar, Logan; Rosman, Benjamin; Cohn, Anthony G.; Leonetti, Matteo

doi:10.1007/978-3-031-26412-2_5

Reducing the Planning Horizon Through Reinforcement Learning

Logan Dunbar, Benjamin Rosman, Anthony G. Cohn, Matteo Leonetti

ECML-PKDD 2022 pp. 68-83

doi:10.1007/978-3-031-26412-2_5 /ecmlpkdd/2022/dunbar2022ecmlpkdd-reducing/

Abstract

Planning is a computationally expensive process, which can limit the reactivity of autonomous agents. Planning problems are usually solved in isolation, independently of similar, previously solved problems. The depth of search that a planner requires to find a solution, known as the planning horizon, is a critical factor when integrating planners into reactive agents. We consider the case of an agent repeatedly carrying out a task from different initial states. We propose a combination of classical planning and model-free reinforcement learning to reduce the planning horizon over time. Control is smoothly transferred from the planner to the model-free policy as the agent compiles the planner’s policy into a value function. Local exploration of the model-free policy allows the agent to adapt to the environment and eventually overcome model inaccuracies. We evaluate the efficacy of our framework on symbolic PDDL domains and a stochastic grid world environment and show that we are able to significantly reduce the planning horizon while improving upon model inaccuracies.

PDF ECML-PKDD Semantic Scholar

Cite

Text

Dunbar et al. "Reducing the Planning Horizon Through Reinforcement Learning." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2022. doi:10.1007/978-3-031-26412-2_5

Markdown

[Dunbar et al. "Reducing the Planning Horizon Through Reinforcement Learning." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2022.](https://mlanthology.org/ecmlpkdd/2022/dunbar2022ecmlpkdd-reducing/) doi:10.1007/978-3-031-26412-2_5

BibTeX

@inproceedings{dunbar2022ecmlpkdd-reducing,
  title     = {{Reducing the Planning Horizon Through Reinforcement Learning}},
  author    = {Dunbar, Logan and Rosman, Benjamin and Cohn, Anthony G. and Leonetti, Matteo},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2022},
  pages     = {68-83},
  doi       = {10.1007/978-3-031-26412-2_5},
  url       = {https://mlanthology.org/ecmlpkdd/2022/dunbar2022ecmlpkdd-reducing/}
}