Video Saliency Detection via Dynamic Consistent Spatio-Temporal Attention Modelling

Abstract

Human vision system actively seeks salient regions and movements in video sequences to reduce the search effort. Modeling computational visual saliency map provides im-portant information for semantic understanding in many real world applications. In this paper, we propose a novel video saliency detection model for detecting the attended regions that correspond to both interesting objects and dominant motions in video sequences. In spatial saliency map, we in-herit the classical bottom-up spatial saliency map. In tem-poral saliency map, a novel optical flow model is proposed based on the dynamic consistency of motion. The spatial and the temporal saliency maps are constructed and further fused together to create a novel attention model. The pro-posed attention model is evaluated on three video datasets. Empirical validations demonstrate the salient regions de-tected by our dynamic consistent saliency map highlight the interesting objects effectively and efficiency. More im-portantly, the automatically video attended regions detected by proposed attention model are consistent with the ground truth saliency maps of eye movement data.

Cite

Text

Zhong et al. "Video Saliency Detection via Dynamic Consistent Spatio-Temporal Attention Modelling." AAAI Conference on Artificial Intelligence, 2013. doi:10.1609/AAAI.V27I1.8642

Markdown

[Zhong et al. "Video Saliency Detection via Dynamic Consistent Spatio-Temporal Attention Modelling." AAAI Conference on Artificial Intelligence, 2013.](https://mlanthology.org/aaai/2013/zhong2013aaai-video/) doi:10.1609/AAAI.V27I1.8642

BibTeX

@inproceedings{zhong2013aaai-video,
  title     = {{Video Saliency Detection via Dynamic Consistent Spatio-Temporal Attention Modelling}},
  author    = {Zhong, Sheng-hua and Liu, Yan and Ren, Feifei and Zhang, Jinghuan and Ren, Tongwei},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2013},
  pages     = {1063-1069},
  doi       = {10.1609/AAAI.V27I1.8642},
  url       = {https://mlanthology.org/aaai/2013/zhong2013aaai-video/}
}