Meta-Learning Parameterized Skills
Abstract
We propose a novel parameterized skill-learning algorithm that aims to learn transferable parameterized skills and synthesize them into a new action space that supports efficient learning in long-horizon tasks. We propose to leverage off-policy Meta-RL combined with a trajectory-centric smoothness term to learn a set of parameterized skills. Our agent can use these learned skills to construct a three-level hierarchical framework that models a Temporally-extended Parameterized Action Markov Decision Process. We empirically demonstrate that the proposed algorithms enable an agent to solve a set of highly difficult long-horizon (obstacle-course and robot manipulation) tasks.
Cite
Text
Fu et al. "Meta-Learning Parameterized Skills." International Conference on Machine Learning, 2023.Markdown
[Fu et al. "Meta-Learning Parameterized Skills." International Conference on Machine Learning, 2023.](https://mlanthology.org/icml/2023/fu2023icml-metalearning/)BibTeX
@inproceedings{fu2023icml-metalearning,
title = {{Meta-Learning Parameterized Skills}},
author = {Fu, Haotian and Yu, Shangqun and Tiwari, Saket and Littman, Michael and Konidaris, George},
booktitle = {International Conference on Machine Learning},
year = {2023},
pages = {10461-10481},
volume = {202},
url = {https://mlanthology.org/icml/2023/fu2023icml-metalearning/}
}