Demonstration-Guided Reinforcement Learning with Learned Skills

Abstract

Demonstration-guided reinforcement learning (RL) is a promising approach for learning complex behaviors by leveraging both reward feedback and a set of target task demonstrations. Prior approaches for demonstration-guided RL treat every new task as an independent learning problem and attempt to follow the provided demonstrations step-by-step, akin to a human trying to imitate a completely unseen behavior by following the demonstrator’s exact muscle movements. Naturally, such learning will be slow, but often new behaviors are not completely unseen: they share subtasks with behaviors we have previously learned. In this work, we aim to exploit this shared subtask structure to increase the efficiency of demonstration-guided RL. We first learn a set of reusable skills from large offline datasets of prior experience collected across many tasks. We then propose Skill-based Learning with Demonstrations (SkiLD), an algorithm for demonstration-guided RL that efficiently leverages the provided demonstrations by following the demonstrated skills instead of the primitive actions, resulting in substantial performance improvements over prior demonstration-guided RL approaches. We validate the effectiveness of our approach on long-horizon maze navigation and complex robot manipulation tasks.

Cite

Text

Pertsch et al. "Demonstration-Guided Reinforcement Learning with Learned Skills." Conference on Robot Learning, 2021.

Markdown

[Pertsch et al. "Demonstration-Guided Reinforcement Learning with Learned Skills." Conference on Robot Learning, 2021.](https://mlanthology.org/corl/2021/pertsch2021corl-demonstrationguided/)

BibTeX

@inproceedings{pertsch2021corl-demonstrationguided,
  title     = {{Demonstration-Guided Reinforcement Learning with Learned Skills}},
  author    = {Pertsch, Karl and Lee, Youngwoon and Wu, Yue and Lim, Joseph J},
  booktitle = {Conference on Robot Learning},
  year      = {2021},
  pages     = {729-739},
  volume    = {164},
  url       = {https://mlanthology.org/corl/2021/pertsch2021corl-demonstrationguided/}
}