Introducing Symmetries to Black Box Meta Reinforcement Learning
Abstract
Meta reinforcement learning (RL) attempts to discover new RL algorithms automatically from environment interaction. In so-called black-box approaches, the policy and the learning algorithm are jointly represented by a single neural network. These methods are very flexible, but they tend to underperform compared to human-engineered RL algorithms in terms of generalisation to new, unseen environments. In this paper, we explore the role of symmetries in meta-generalisation. We show that a recent successful meta RL approach that meta-learns an objective for backpropagation-based learning exhibits certain symmetries (specifically the reuse of the learning rule, and invariance to input and output permutations) that are not present in typical black-box meta RL systems. We hypothesise that these symmetries can play an important role in meta-generalisation. Building off recent work in black-box supervised meta learning, we develop a black-box meta RL system that exhibits these same symmetries. We show through careful experimentation that incorporating these symmetries can lead to algorithms with a greater ability to generalise to unseen action & observation spaces, tasks, and environments.
Cite
Text
Kirsch et al. "Introducing Symmetries to Black Box Meta Reinforcement Learning." AAAI Conference on Artificial Intelligence, 2022. doi:10.1609/AAAI.V36I7.20681Markdown
[Kirsch et al. "Introducing Symmetries to Black Box Meta Reinforcement Learning." AAAI Conference on Artificial Intelligence, 2022.](https://mlanthology.org/aaai/2022/kirsch2022aaai-introducing/) doi:10.1609/AAAI.V36I7.20681BibTeX
@inproceedings{kirsch2022aaai-introducing,
title = {{Introducing Symmetries to Black Box Meta Reinforcement Learning}},
author = {Kirsch, Louis and Flennerhag, Sebastian and van Hasselt, Hado and Friesen, Abram L. and Oh, Junhyuk and Chen, Yutian},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2022},
pages = {7202-7210},
doi = {10.1609/AAAI.V36I7.20681},
url = {https://mlanthology.org/aaai/2022/kirsch2022aaai-introducing/}
}