How Much Time Do You Have? Modeling Multi-Duration Saliency

Abstract

What jumps out in a single glance of an image is different than what you might notice after closer inspection. Yet conventional models of visual saliency produce predictions at an arbitrary, fixed viewing duration, offering a limited view of the rich interactions between image content and gaze location. In this paper we propose to capture gaze as a series of snapshots, by generating population-level saliency heatmaps for multiple viewing durations. We collect the CodeCharts1K dataset, which contains multiple distinct heatmaps per image corresponding to 0.5, 3, and 5 seconds of free-viewing. We develop an LSTM-based model of saliency that simultaneously trains on data from multiple viewing durations. Our Multi-Duration Saliency Excited Model (MD-SEM) achieves competitive performance on the LSUN 2017 Challenge with 57% fewer parameters than comparable architectures. It is the first model that produces heatmaps at multiple viewing durations, enabling applications where multi-duration saliency can be used to prioritize visual content to keep, transmit, and render.

Cite

Text

Fosco et al. "How Much Time Do You Have? Modeling Multi-Duration Saliency." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.00453

Markdown

[Fosco et al. "How Much Time Do You Have? Modeling Multi-Duration Saliency." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/fosco2020cvpr-much/) doi:10.1109/CVPR42600.2020.00453

BibTeX

@inproceedings{fosco2020cvpr-much,
  title     = {{How Much Time Do You Have? Modeling Multi-Duration Saliency}},
  author    = {Fosco, Camilo and Newman, Anelise and Sukhum, Pat and Bin Zhang, Yun and Zhao, Nanxuan and Oliva, Aude and Bylinskii, Zoya},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2020},
  doi       = {10.1109/CVPR42600.2020.00453},
  url       = {https://mlanthology.org/cvpr/2020/fosco2020cvpr-much/}
}