Transferring Hierarchical Structures with Dual Meta Imitation Learning

Abstract

Hierarchical Imitation Learning (HIL) is an effective way for robots to learn sub-skills from long-horizon unsegmented demonstrations. However, the learned hierarchical structure lacks the mechanism to transfer across multi-tasks or to new tasks, which makes them have to learn from scratch when facing a new situation. Transferring and reorganizing modular sub-skills require fast adaptation ability of the whole hierarchical structure. In this work, we propose Dual Meta Imitation Learning (DMIL), a hierarchical meta imitation learning method where the high-level network and sub-skills are iteratively meta-learned with model-agnostic meta-learning. DMIL uses the likelihood of state-action pairs from each sub-skill as the supervision for the high-level network adaptation and uses the adapted high-level network to determine different data set for each sub-skill adaptation. We theoretically prove the convergence of the iterative training process of DMIL and establish the connection between DMIL and Expectation-Maximization algorithm. Empirically, we achieve state-of-the-art few-shot imitation learning performance on the Meta-world benchmark and competitive results on long-horizon tasks in Kitchen environments.

Cite

Text

Gao et al. "Transferring Hierarchical Structures with Dual Meta Imitation Learning." Conference on Robot Learning, 2022.

Markdown

[Gao et al. "Transferring Hierarchical Structures with Dual Meta Imitation Learning." Conference on Robot Learning, 2022.](https://mlanthology.org/corl/2022/gao2022corl-transferring/)

BibTeX

@inproceedings{gao2022corl-transferring,
  title     = {{Transferring Hierarchical Structures with Dual Meta Imitation Learning}},
  author    = {Gao, Chongkai and Jiang, Yizhou and Chen, Feng},
  booktitle = {Conference on Robot Learning},
  year      = {2022},
  pages     = {762-773},
  volume    = {205},
  url       = {https://mlanthology.org/corl/2022/gao2022corl-transferring/}
}