RL-TOPS: An Architecture for Modularity and Re-Use in Reinforcement Learning

Abstract

This paper introduces the RL-TOPs architecture for robot learning, a hybrid system combining teleo-reactive planning and reinforcement learning techniques. The aim of this system is to speed up learning by decomposing complex tasks into hierarchies of simple behaviours which can be learnt more easily. Behaviours learnt in this way can subsequently be re-used to solve a variety of problems, reducing the need to learn every new task from scratch. It is even possible to learn multiple behaviours simultaneously, thus making more efficient use of experience. We demonstrate these advantages in a simple simulated environment.

Cite

Text

Ryan and Pendrith. "RL-TOPS: An Architecture for Modularity and Re-Use in Reinforcement Learning." International Conference on Machine Learning, 1998.

Markdown

[Ryan and Pendrith. "RL-TOPS: An Architecture for Modularity and Re-Use in Reinforcement Learning." International Conference on Machine Learning, 1998.](https://mlanthology.org/icml/1998/ryan1998icml-rl/)

BibTeX

@inproceedings{ryan1998icml-rl,
  title     = {{RL-TOPS: An Architecture for Modularity and Re-Use in Reinforcement Learning}},
  author    = {Ryan, Malcolm R. K. and Pendrith, Mark D.},
  booktitle = {International Conference on Machine Learning},
  year      = {1998},
  pages     = {481-487},
  url       = {https://mlanthology.org/icml/1998/ryan1998icml-rl/}
}