Comparing Intuitions About Agents’ Goals, Preferences and Actions in Human Infants and Video Transformers
Abstract
Although AI has made large strides in recent years, state-of-the-art models still largely lack core components of social cognition which emerge early on in infant development. The Baby Intuitions Benchmark was explicitly designed to compare these "commonsense psychology" abilities in humans and machines. Recurrent neural network-based models previously applied to this dataset have been shown to not capture the desired knowledge. We here apply a different class of deep learning-based model, namely a video transformer, and show that it quantitatively more closely matches infant intuitions. However, qualitative error analyses show that model is prone to exploiting particularities of the training data for its decisions.
Cite
Text
Hein and Diepold. "Comparing Intuitions About Agents’ Goals, Preferences and Actions in Human Infants and Video Transformers." NeurIPS 2022 Workshops: SVRHM, 2022.Markdown
[Hein and Diepold. "Comparing Intuitions About Agents’ Goals, Preferences and Actions in Human Infants and Video Transformers." NeurIPS 2022 Workshops: SVRHM, 2022.](https://mlanthology.org/neuripsw/2022/hein2022neuripsw-comparing/)BibTeX
@inproceedings{hein2022neuripsw-comparing,
title = {{Comparing Intuitions About Agents’ Goals, Preferences and Actions in Human Infants and Video Transformers}},
author = {Hein, Alice and Diepold, Klaus},
booktitle = {NeurIPS 2022 Workshops: SVRHM},
year = {2022},
url = {https://mlanthology.org/neuripsw/2022/hein2022neuripsw-comparing/}
}