Sub-Task Discovery with Limited Supervision: A Constrained Clustering Approach

Abstract

Hierarchical reinforcement learning captures sub-task information to learn modular policies that can be quickly adapted to new tasks. While hierarchies can be learned jointly with policies, this requires a lot of interaction. Traditional approaches require less data, but typically require sub-task labels to build a task hierarchy. We propose a semi-supervised constrained clustering approach to alleviate the labeling and interaction requirements. Our approach combines limited supervision with an arbitrary set of weak constraints, obtained purely from observations, that is jointly optimized to produce a clustering of the states into sub-tasks. We demonstrate improvement in two visual reinforcement learning tasks.

Cite

Text

Odom et al. "Sub-Task Discovery with Limited Supervision: A Constrained Clustering Approach." ICLR 2019 Workshops: LLD, 2019.

Markdown

[Odom et al. "Sub-Task Discovery with Limited Supervision: A Constrained Clustering Approach." ICLR 2019 Workshops: LLD, 2019.](https://mlanthology.org/iclrw/2019/odom2019iclrw-subtask/)

BibTeX

@inproceedings{odom2019iclrw-subtask,
  title     = {{Sub-Task Discovery with Limited Supervision: A Constrained Clustering Approach}},
  author    = {Odom, Phillip and Keech, Aaron and Kira, Zsolt},
  booktitle = {ICLR 2019 Workshops: LLD},
  year      = {2019},
  url       = {https://mlanthology.org/iclrw/2019/odom2019iclrw-subtask/}
}