Dynamic Context-Sensitive Filtering Network for Video Salient Object Detection
Abstract
The ability to capture inter-frame dynamics has been critical to the development of video salient object detection (VSOD). While many works have achieved great success in this field, a deeper insight into its dynamic nature should be developed. In this work, we aim to answer the following questions: How can a model adjust itself to dynamic variations as well as perceive fine differences in the real-world environment; How are the temporal dynamics well introduced into spatial information over time? To this end, we propose a dynamic context-sensitive filtering network (DCFNet) equipped with a dynamic context-sensitive filtering module (DCFM) and an effective bidirectional dynamic fusion strategy. The proposed DCFM sheds new light on dynamic filter generation by extracting location-related affinities between consecutive frames. Our bidirectional dynamic fusion strategy encourages the interaction of spatial and temporal information in a dynamic manner. Experimental results demonstrate that our proposed method can achieve state-of-the-art performance on most VSOD datasets while ensuring a real-time speed of 28 fps. The source code is publicly available at https://github.com/OIPLab-DUT/DCFNet.
Cite
Text
Zhang et al. "Dynamic Context-Sensitive Filtering Network for Video Salient Object Detection." International Conference on Computer Vision, 2021. doi:10.1109/ICCV48922.2021.00158Markdown
[Zhang et al. "Dynamic Context-Sensitive Filtering Network for Video Salient Object Detection." International Conference on Computer Vision, 2021.](https://mlanthology.org/iccv/2021/zhang2021iccv-dynamic/) doi:10.1109/ICCV48922.2021.00158BibTeX
@inproceedings{zhang2021iccv-dynamic,
title = {{Dynamic Context-Sensitive Filtering Network for Video Salient Object Detection}},
author = {Zhang, Miao and Liu, Jie and Wang, Yifei and Piao, Yongri and Yao, Shunyu and Ji, Wei and Li, Jingjing and Lu, Huchuan and Luo, Zhongxuan},
booktitle = {International Conference on Computer Vision},
year = {2021},
pages = {1553-1563},
doi = {10.1109/ICCV48922.2021.00158},
url = {https://mlanthology.org/iccv/2021/zhang2021iccv-dynamic/}
}