Fast and Slow Learning of Recurrent Independent Mechanisms
Abstract
Decomposing knowledge into interchangeable pieces promises a generalization advantage when there are changes in distribution. A learning agent interacting with its environment is likely to be faced with situations requiring novel combinations of existing pieces of knowledge. We hypothesize that such a decomposition of knowledge is particularly relevant for being able to generalize in a systematic way to out-of-distribution changes. To study these ideas, we propose a particular training framework in which we assume that the pieces of knowledge an agent needs and its reward function are stationary and can be re-used across tasks. An attention mechanism dynamically selects which modules can be adapted to the current task, and the parameters of the \textit{selected} modules are allowed to change quickly as the learner is confronted with variations in what it experiences, while the parameters of the attention mechanisms act as stable, slowly changing, meta-parameters. We focus on pieces of knowledge captured by an ensemble of modules sparsely communicating with each other via a bottleneck of attention. We find that meta-learning the modular aspects of the proposed system greatly helps in achieving faster adaptation in a reinforcement learning setup involving navigation in a partially observed grid world with image-level input. We also find that reversing the role of parameters and meta-parameters does not work nearly as well, suggesting a particular role for fast adaptation of the dynamically selected modules.
Cite
Text
Madan et al. "Fast and Slow Learning of Recurrent Independent Mechanisms." International Conference on Learning Representations, 2021.Markdown
[Madan et al. "Fast and Slow Learning of Recurrent Independent Mechanisms." International Conference on Learning Representations, 2021.](https://mlanthology.org/iclr/2021/madan2021iclr-fast/)BibTeX
@inproceedings{madan2021iclr-fast,
title = {{Fast and Slow Learning of Recurrent Independent Mechanisms}},
author = {Madan, Kanika and Ke, Nan Rosemary and Goyal, Anirudh and Schölkopf, Bernhard and Bengio, Yoshua},
booktitle = {International Conference on Learning Representations},
year = {2021},
url = {https://mlanthology.org/iclr/2021/madan2021iclr-fast/}
}