QUICKSAL: A Small and Sparse Visual Saliency Model for Efficient Inference in Resource Constrained Hardware

Abstract

Visual saliency is an important problem in the field of cognitive science and computer vision with applications such as surveillance, adaptive compressing, detecting unknown objects and scene understanding. In this paper, we propose a small and sparse neural network model for performing salient object segmentation that is suitable for use in mobile and embedded applications. Our model is built using depthwise separable convolutions and bottleneck inverted residuals which have been proven to perform very memory-efficient inference and can be easily implemented using standard functions available in all deep learning frameworks. The multiscale features extracted along with the layers with deep residuals allow our network to learn high-quality saliency maps. We present the quantitative results of our QUICKSAL model with multiple levels of model sparsity ranging from 0% to 96%, with the non-zero parameter count varying from 3.3M to 0.14M respectively - on publicly available benchmark datasets - showing that our highly constrained approach is comparable to other state-of-the-art approaches (parameter count 35M). We also present qualitative results on camouflage images and show that our model can successfully distinguish between the salient and non-salient parts even when both seem blended together.

Cite

Text

Ramanathan et al. "QUICKSAL: A Small and Sparse Visual Saliency Model for Efficient Inference in Resource Constrained Hardware." Winter Conference on Applications of Computer Vision, 2020.

Markdown

[Ramanathan et al. "QUICKSAL: A Small and Sparse Visual Saliency Model for Efficient Inference in Resource Constrained Hardware." Winter Conference on Applications of Computer Vision, 2020.](https://mlanthology.org/wacv/2020/ramanathan2020wacv-quicksal/)

BibTeX

@inproceedings{ramanathan2020wacv-quicksal,
  title     = {{QUICKSAL: A Small and Sparse Visual Saliency Model for Efficient Inference in Resource Constrained Hardware}},
  author    = {Ramanathan, Vignesh and Dwivedi, Pritesh and Katabathuni, Bharath and Chakraborty, Anirban and Thakur, Chetan Singh},
  booktitle = {Winter Conference on Applications of Computer Vision},
  year      = {2020},
  url       = {https://mlanthology.org/wacv/2020/ramanathan2020wacv-quicksal/}
}