Recurrent Independent Mechanisms
Abstract
We explore the hypothesis that learning modular structures which reflect the dynamics of the environment can lead to better generalization and robustness to changes that only affect a few of the underlying causes. We propose Recurrent Independent Mechanisms (RIMs), a new recurrent architecture in which multiple groups of recurrent cells operate with nearly independent transition dynamics, communicate only sparingly through the bottleneck of attention, and compete with each other so they are updated only at time steps where they are most relevant. We show that this leads to specialization amongst the RIMs, which in turn allows for remarkably improved generalization on tasks where some factors of variation differ systematically between training and evaluation.
Cite
Text
Goyal et al. "Recurrent Independent Mechanisms." International Conference on Learning Representations, 2021.Markdown
[Goyal et al. "Recurrent Independent Mechanisms." International Conference on Learning Representations, 2021.](https://mlanthology.org/iclr/2021/goyal2021iclr-recurrent/)BibTeX
@inproceedings{goyal2021iclr-recurrent,
title = {{Recurrent Independent Mechanisms}},
author = {Goyal, Anirudh and Lamb, Alex and Hoffmann, Jordan and Sodhani, Shagun and Levine, Sergey and Bengio, Yoshua and Schölkopf, Bernhard},
booktitle = {International Conference on Learning Representations},
year = {2021},
url = {https://mlanthology.org/iclr/2021/goyal2021iclr-recurrent/}
}