A Simple Neural Network Module for Relational Reasoning
Abstract
Relational reasoning is a central component of generally intelligent behavior, but has proven difficult for neural networks to learn. In this paper we describe how to use Relation Networks (RNs) as a simple plug-and-play module to solve problems that fundamentally hinge on relational reasoning. We tested RN-augmented networks on three tasks: visual question answering using a challenging dataset called CLEVR, on which we achieve state-of-the-art, super-human performance; text-based question answering using the bAbI suite of tasks; and complex reasoning about dynamical physical systems. Then, using a curated dataset called Sort-of-CLEVR we show that powerful convolutional networks do not have a general capacity to solve relational questions, but can gain this capacity when augmented with RNs. Thus, by simply augmenting convolutions, LSTMs, and MLPs with RNs, we can remove computational burden from network components that are not well-suited to handle relational reasoning, reduce overall network complexity, and gain a general ability to reason about the relations between entities and their properties.
Cite
Text
Santoro et al. "A Simple Neural Network Module for Relational Reasoning." Neural Information Processing Systems, 2017.Markdown
[Santoro et al. "A Simple Neural Network Module for Relational Reasoning." Neural Information Processing Systems, 2017.](https://mlanthology.org/neurips/2017/santoro2017neurips-simple/)BibTeX
@inproceedings{santoro2017neurips-simple,
title = {{A Simple Neural Network Module for Relational Reasoning}},
author = {Santoro, Adam and Raposo, David and Barrett, David G and Malinowski, Mateusz and Pascanu, Razvan and Battaglia, Peter and Lillicrap, Timothy},
booktitle = {Neural Information Processing Systems},
year = {2017},
pages = {4967-4976},
url = {https://mlanthology.org/neurips/2017/santoro2017neurips-simple/}
}