TACO: Learning Task Decomposition via Temporal Alignment for Control

Abstract

Many advanced Learning from Demonstration (LfD) methods consider the decomposition of complex, real-world tasks into simpler sub-tasks. By reusing the corresponding sub-policies within and between tasks, we can provide training data for each policy from different high-level tasks and compose them to perform novel ones. Existing approaches to modular LfD focus either on learning a single high-level task or depend on domain knowledge and temporal segmentation. In contrast, we propose a weakly supervised, domain-agnostic approach based on task sketches, which include only the sequence of sub-tasks performed in each demonstration. Our approach simultaneously aligns the sketches with the observed demonstrations and learns the required sub-policies. This improves generalisation in comparison to separate optimisation procedures. We evaluate the approach on multiple domains, including a simulated 3D robot arm control task using purely image-based observations. The results show that our approach performs commensurately with fully supervised approaches, while requiring significantly less annotation effort.

Cite

Text

Shiarlis et al. "TACO: Learning Task Decomposition via Temporal Alignment for Control." International Conference on Machine Learning, 2018.

Markdown

[Shiarlis et al. "TACO: Learning Task Decomposition via Temporal Alignment for Control." International Conference on Machine Learning, 2018.](https://mlanthology.org/icml/2018/shiarlis2018icml-taco/)

BibTeX

@inproceedings{shiarlis2018icml-taco,
  title     = {{TACO: Learning Task Decomposition via Temporal Alignment for Control}},
  author    = {Shiarlis, Kyriacos and Wulfmeier, Markus and Salter, Sasha and Whiteson, Shimon and Posner, Ingmar},
  booktitle = {International Conference on Machine Learning},
  year      = {2018},
  pages     = {4654-4663},
  volume    = {80},
  url       = {https://mlanthology.org/icml/2018/shiarlis2018icml-taco/}
}