BBS-Net: RGB-D Salient Object Detection with a Bifurcated Backbone Strategy Network
Abstract
Multi-level feature fusion is a fundamental topic in computer vision for detecting, segmenting, and classifying objects at various scales. When multi-level features meet multi-modal cues, the optimal fusion problem becomes a hot potato. In this paper, we make the first attempt to leverage the inherent multi-modal and multi-level nature of RGB-D salient object detection to develop a novel cascaded refinement network. In particular, we 1) propose a bifurcated backbone strategy (BBS) to split the multi-level features into teacher and student features, and 2) utilize a depth-enhanced module (DEM) to excavate informative parts of depth cues from the channel and spatial views. This fuses RGB and depth modalities in a complementary way. Our simple yet efficient architecture, dubbed Bifurcated Backbone Strategy Network (BBS-Net), is backbone independent, and outperforms 18 SOTAs on seven challenging datasets using four metrics.
Cite
Text
Fan et al. "BBS-Net: RGB-D Salient Object Detection with a Bifurcated Backbone Strategy Network." Proceedings of the European Conference on Computer Vision (ECCV), 2020. doi:10.1007/978-3-030-58610-2_17Markdown
[Fan et al. "BBS-Net: RGB-D Salient Object Detection with a Bifurcated Backbone Strategy Network." Proceedings of the European Conference on Computer Vision (ECCV), 2020.](https://mlanthology.org/eccv/2020/fan2020eccv-bbsnet/) doi:10.1007/978-3-030-58610-2_17BibTeX
@inproceedings{fan2020eccv-bbsnet,
title = {{BBS-Net: RGB-D Salient Object Detection with a Bifurcated Backbone Strategy Network}},
author = {Fan, Deng-Ping and Zhai, Yingjie and Borji, Ali and Yang, Jufeng and Shao, Ling},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2020},
doi = {10.1007/978-3-030-58610-2_17},
url = {https://mlanthology.org/eccv/2020/fan2020eccv-bbsnet/}
}