Unsupervised Salient Instance Detection

Abstract

The significant amount of manual efforts in annotating pixel-level labels has triggered the advancement of unsupervised saliency learning. However without supervision signals state-of-the-art methods can only infer region-level saliency. In this paper we propose to explore the unsupervised salient instance detection (USID) problem for a more fine-grained visual understanding. Our key observation is that self-supervised transformer features may exhibit local similarities as well as different levels of contrast to other regions which provide informative cues to identify salient instances. Hence we propose SCoCo a novel network that models saliency coherence and contrast for USID. SCoCo includes two novel modules: (1) a global background adaptation (GBA) module with a scene-level contrastive loss to extract salient regions from the scene by searching the adaptive "saliency threshold" in the self-supervised transformer features and (2) a locality-aware similarity (LAS) module with an instance-level contrastive loss to group salient regions into instances by modeling the in-region saliency coherence and cross-region saliency contrasts. Extensive experiments show that SCoCo outperforms state-of-the-art weakly-supervised SID methods and carefully designed unsupervised baselines and has comparable performances to fully-supervised SID methods.

Cite

Text

Tian et al. "Unsupervised Salient Instance Detection." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.00261

Markdown

[Tian et al. "Unsupervised Salient Instance Detection." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/tian2024cvpr-unsupervised/) doi:10.1109/CVPR52733.2024.00261

BibTeX

@inproceedings{tian2024cvpr-unsupervised,
  title     = {{Unsupervised Salient Instance Detection}},
  author    = {Tian, Xin and Xu, Ke and Lau, Rynson},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2024},
  pages     = {2702-2712},
  doi       = {10.1109/CVPR52733.2024.00261},
  url       = {https://mlanthology.org/cvpr/2024/tian2024cvpr-unsupervised/}
}