Example-Driven Model-Based Reinforcement Learning for Solving Long-Horizon Visuomotor Tasks

Abstract

In this paper, we study the problem of learning a repertoire of low-level skills from raw images that can be sequenced to complete long-horizon visuomotor tasks. Reinforcement learning (RL) is a promising approach for acquiring short-horizon skills autonomously. However, the focus of RL algorithms has largely been on the success of those individual skills, more so than learning and grounding a large repertoire of skills that can be sequenced to complete extended multi-stage tasks. The latter demands robustness and persistence, as errors in skills can compound over time, and may require the robot to have a number of primitive skills in its repertoire, rather than just one. To this end, we introduce EMBR, a model-based RL method for learning primitive skills that are suitable for completing long-horizon visuomotor tasks. EMBR learns and plans using a learned model, critic, and success classifier, where the success classifier serves both as a reward function for RL and as a grounding mechanism to continuously detect if the robot should retry a skill when unsuccessful or under perturbations. Further, the learned model is task-agnostic and trained using data from all skills, enabling the robot to efficiently learn a number of distinct primitives. These visuomotor primitive skills and their associated pre- and post-conditions can then be directly combined with off-the-shelf symbolic planners to complete long-horizon tasks. On a Franka Emika robot arm, we find that EMBR enables the robot to complete three long-horizon visuomotor tasks at 85% success rate, such as organizing an office desk, a file cabinet, and drawers, which require sequencing up to 12 skills, involve 14 unique learned primitives, and demand generalization to novel objects.

Cite

Text

Wu et al. "Example-Driven Model-Based Reinforcement Learning for Solving Long-Horizon Visuomotor Tasks." Conference on Robot Learning, 2021.

Markdown

[Wu et al. "Example-Driven Model-Based Reinforcement Learning for Solving Long-Horizon Visuomotor Tasks." Conference on Robot Learning, 2021.](https://mlanthology.org/corl/2021/wu2021corl-exampledriven/)

BibTeX

@inproceedings{wu2021corl-exampledriven,
  title     = {{Example-Driven Model-Based Reinforcement Learning for Solving Long-Horizon Visuomotor Tasks}},
  author    = {Wu, Bohan and Nair, Suraj and Fei-Fei, Li and Finn, Chelsea},
  booktitle = {Conference on Robot Learning},
  year      = {2021},
  pages     = {1-13},
  volume    = {164},
  url       = {https://mlanthology.org/corl/2021/wu2021corl-exampledriven/}
}