Visual Goal-Directed Meta-Imitation Learning

Abstract

The goal of meta-learning is to generalize to new tasks and goals as quickly as possible. Ideally, we would like approaches that generalize to new goals and tasks on the first attempt. Requiring a policy to perform on a new task on the first attempt without even a single example trajectory is a zero-shot problem formulation. When tasks are identified by goal images, the tasks can be considered visually goal-directed. In this work, we explore the problem of visual goal-directed zero-shot meta-imitation learning. Inspired by several popular approaches to Meta-RL, we composed several core ideas related to task-embedding and planning by gradient descent to attempt to explore this problem. To evaluate these approaches, we adapted the Meta-world benchmark tasks to create 24 distinct visual goal-directed manipulation tasks. We found that 7 out of 24 tasks could be successfully completed on the first attempt by at least one of the approaches we tested. We demonstrated that goal-directed zero-shot approaches can translate to a physical robot with a demonstration based on Jenga block manipulation tasks using a Kinova Jaco robotic arm.

Cite

Text

Rivera et al. "Visual Goal-Directed Meta-Imitation Learning." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022. doi:10.1109/CVPRW56347.2022.00421

Markdown

[Rivera et al. "Visual Goal-Directed Meta-Imitation Learning." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022.](https://mlanthology.org/cvprw/2022/rivera2022cvprw-visual/) doi:10.1109/CVPRW56347.2022.00421

BibTeX

@inproceedings{rivera2022cvprw-visual,
  title     = {{Visual Goal-Directed Meta-Imitation Learning}},
  author    = {Rivera, Corban G. and Handelman, David A. and Ratto, Christopher R. and Patrone, David and Paulhamus, Bart L.},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2022},
  pages     = {3766-3772},
  doi       = {10.1109/CVPRW56347.2022.00421},
  url       = {https://mlanthology.org/cvprw/2022/rivera2022cvprw-visual/}
}