Simulating Human Gaze with Neural Visual Attention

Abstract

Existing models of human visual attention are generally unable to incorporate direct task guidance and therefore cannot model an intent or goal when exploring a scene. To integrate guidance of any downstream visual task into attention modeling, we propose the Neural Visual Attention (NeVA) algorithm. To this end, we impose to neural networks the biological constraint of foveated vision and train an attention mechanism to generate visual explorations that maximize the performance with respect to the downstream task. We observe that biologically constrained neural networks generate human-like scanpaths without being trained for this objective. Extensive experiments on three common benchmark datasets show that our method outperforms state-of-the-art unsupervised human attention models in generating human-like scanpaths. Full paper available at TMLR: https://openreview.net/forum?id=7iSYW1FRWA.

Cite

Text

Schwinn et al. "Simulating Human Gaze with Neural Visual Attention." NeurIPS 2022 Workshops: GMML, 2022.

Markdown

[Schwinn et al. "Simulating Human Gaze with Neural Visual Attention." NeurIPS 2022 Workshops: GMML, 2022.](https://mlanthology.org/neuripsw/2022/schwinn2022neuripsw-simulating/)

BibTeX

@inproceedings{schwinn2022neuripsw-simulating,
  title     = {{Simulating Human Gaze with Neural Visual Attention}},
  author    = {Schwinn, Leo and Precup, Doina and Eskofier, Bjoern and Zanca, Dario},
  booktitle = {NeurIPS 2022 Workshops: GMML},
  year      = {2022},
  url       = {https://mlanthology.org/neuripsw/2022/schwinn2022neuripsw-simulating/}
}