An Object-Based Bayesian Framework for Top-Down Visual Attention

Abstract

We introduce a new task-independent framework to model top-down overt visual attention based on graph-ical models for probabilistic inference and reasoning. We describe a Dynamic Bayesian Network (DBN) that infers probability distributions over attended objects and spatial locations directly from observed data. Probabilistic inference in our model is performed over object-related functions which are fed from manual annotations of objects in video scenes or by state-of-the-art object detection models. Evaluating over ∼3 hours (appx. 315,000 eye fixations and 12,600 saccades) of observers playing 3 video games (time-scheduling, driving, and flight combat), we show that our approach is significantly more predictive of eye fixations compared to: 1) simpler classifier-based models also developed here that map a signature of a scene (multi-modal information from gist, bottom-up saliency, physical actions, and events) to eye positions, 2) 14 state-of-the-art bottom-up saliency models, and 3) brute-force algorithms such as mean eye position. Our results show that the proposed model is more effective in employing and reasoning over spatio-temporal visual data.

Cite

Text

Borji et al. "An Object-Based Bayesian Framework for Top-Down Visual Attention." AAAI Conference on Artificial Intelligence, 2012. doi:10.1609/AAAI.V26I1.8334

Markdown

[Borji et al. "An Object-Based Bayesian Framework for Top-Down Visual Attention." AAAI Conference on Artificial Intelligence, 2012.](https://mlanthology.org/aaai/2012/borji2012aaai-object/) doi:10.1609/AAAI.V26I1.8334

BibTeX

@inproceedings{borji2012aaai-object,
  title     = {{An Object-Based Bayesian Framework for Top-Down Visual Attention}},
  author    = {Borji, Ali and Sihite, Dicky N. and Itti, Laurent},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2012},
  pages     = {1529-1535},
  doi       = {10.1609/AAAI.V26I1.8334},
  url       = {https://mlanthology.org/aaai/2012/borji2012aaai-object/}
}