Machine Theory of Mind

Abstract

Theory of mind (ToM) broadly refers to humans’ ability to represent the mental states of others, including their desires, beliefs, and intentions. We design a Theory of Mind neural network – a ToMnet – which uses meta-learning to build such models of the agents it encounters. The ToMnet learns a strong prior model for agents’ future behaviour, and, using only a small number of behavioural observations, can bootstrap to richer predictions about agents’ characteristics and mental states. We apply the ToMnet to agents behaving in simple gridworld environments, showing that it learns to model random, algorithmic, and deep RL agents from varied populations, and that it passes classic ToM tasks such as the "Sally-Anne" test of recognising that others can hold false beliefs about the world.

Cite

Text

Rabinowitz et al. "Machine Theory of Mind." International Conference on Machine Learning, 2018.

Markdown

[Rabinowitz et al. "Machine Theory of Mind." International Conference on Machine Learning, 2018.](https://mlanthology.org/icml/2018/rabinowitz2018icml-machine/)

BibTeX

@inproceedings{rabinowitz2018icml-machine,
  title     = {{Machine Theory of Mind}},
  author    = {Rabinowitz, Neil and Perbet, Frank and Song, Francis and Zhang, Chiyuan and Eslami, S. M. Ali and Botvinick, Matthew},
  booktitle = {International Conference on Machine Learning},
  year      = {2018},
  pages     = {4218-4227},
  volume    = {80},
  url       = {https://mlanthology.org/icml/2018/rabinowitz2018icml-machine/}
}