Boosting Low-Data Instance Segmentation by Unsupervised Pre-Training with Saliency Prompt

Abstract

Recently, inspired by DETR variants, query-based end-to-end instance segmentation (QEIS) methods have outperformed CNN-based models on large-scale datasets. Yet they would lose efficacy when only a small amount of training data is available since it's hard for the crucial queries/kernels to learn localization and shape priors. To this end, this work offers a novel unsupervised pre-training solution for low-data regimes. Inspired by the recent success of the Prompting technique, we introduce a new pre-training method that boosts QEIS models by giving Saliency Prompt for queries/kernels. Our method contains three parts: 1) Saliency Masks Proposal is responsible for generating pseudo masks from unlabeled images based on the saliency mechanism. 2) Prompt-Kernel Matching transfers pseudo masks into prompts and injects the corresponding localization and shape priors to the best-matched kernels. 3) Kernel Supervision is applied to supply supervision at the kernel level for robust learning. From a practical perspective, our pre-training method helps QEIS models achieve a similar convergence speed and comparable performance with CNN-based models in low-data regimes. Experimental results show that our method significantly boosts several QEIS models on three datasets.

Cite

Text

Li et al. "Boosting Low-Data Instance Segmentation by Unsupervised Pre-Training with Saliency Prompt." Conference on Computer Vision and Pattern Recognition, 2023. doi:10.1109/CVPR52729.2023.01486

Markdown

[Li et al. "Boosting Low-Data Instance Segmentation by Unsupervised Pre-Training with Saliency Prompt." Conference on Computer Vision and Pattern Recognition, 2023.](https://mlanthology.org/cvpr/2023/li2023cvpr-boosting-a/) doi:10.1109/CVPR52729.2023.01486

BibTeX

@inproceedings{li2023cvpr-boosting-a,
  title     = {{Boosting Low-Data Instance Segmentation by Unsupervised Pre-Training with Saliency Prompt}},
  author    = {Li, Hao and Zhang, Dingwen and Liu, Nian and Cheng, Lechao and Dai, Yalun and Zhang, Chao and Wang, Xinggang and Han, Junwei},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2023},
  pages     = {15485-15494},
  doi       = {10.1109/CVPR52729.2023.01486},
  url       = {https://mlanthology.org/cvpr/2023/li2023cvpr-boosting-a/}
}