Meta-Learning Parameterized Skills

Abstract

We propose a novel parameterized skill-learning algorithm that aims to learn transferable parameterized skills and synthesize them into a new action space that supports efficient learning in long-horizon tasks. We propose to leverage off-policy Meta-RL combined with a trajectory-centric smoothness term to learn a set of parameterized skills. Our agent can use these learned skills to construct a three-level hierarchical framework that models a Temporally-extended Parameterized Action Markov Decision Process. We empirically demonstrate that the proposed algorithms enable an agent to solve a set of highly difficult long-horizon (obstacle-course and robot manipulation) tasks.

Cite

Text

Fu et al. "Meta-Learning Parameterized Skills." International Conference on Machine Learning, 2023.

Markdown

[Fu et al. "Meta-Learning Parameterized Skills." International Conference on Machine Learning, 2023.](https://mlanthology.org/icml/2023/fu2023icml-metalearning/)

BibTeX

@inproceedings{fu2023icml-metalearning,
  title     = {{Meta-Learning Parameterized Skills}},
  author    = {Fu, Haotian and Yu, Shangqun and Tiwari, Saket and Littman, Michael and Konidaris, George},
  booktitle = {International Conference on Machine Learning},
  year      = {2023},
  pages     = {10461-10481},
  volume    = {202},
  url       = {https://mlanthology.org/icml/2023/fu2023icml-metalearning/}
}