Learning Task Agnostic Skills with Data-Driven Guidance
Abstract
To increase autonomy in reinforcement learning, agents need to learn useful behaviours without reliance on manually designed reward functions. To that end, skill discovery methods have been used to learn the intrinsic options available to an agent using task-agnostic objectives. However, without the guidance of task-specific rewards, emergent behaviours are generally useless due to the under-constrained problem of skill discovery in complex and high-dimensional spaces. This paper proposes a framework for guiding the skill discovery towards the subset of expert-visited states using a learned state projection. We apply our method in various reinforcement learning (RL) tasks and show that such a projection results in more useful behaviours.
Cite
Text
Klemsdal et al. "Learning Task Agnostic Skills with Data-Driven Guidance." ICML 2021 Workshops: URL, 2021.Markdown
[Klemsdal et al. "Learning Task Agnostic Skills with Data-Driven Guidance." ICML 2021 Workshops: URL, 2021.](https://mlanthology.org/icmlw/2021/klemsdal2021icmlw-learning/)BibTeX
@inproceedings{klemsdal2021icmlw-learning,
title = {{Learning Task Agnostic Skills with Data-Driven Guidance}},
author = {Klemsdal, Even and Herland, Sverre and Murad, Abdulmajid},
booktitle = {ICML 2021 Workshops: URL},
year = {2021},
url = {https://mlanthology.org/icmlw/2021/klemsdal2021icmlw-learning/}
}