Segment Anything Model (SAM) Enhances Pseudo-Labels for Weakly Supervised Semantic Segmentation

Abstract

Weakly supervised semantic segmentation (WSSS) aims to bypass the need for laborious pixel-level annotation by using only image-level annotation. Most existing methods rely on Class Activation Maps (CAM) to derive pixel-level pseudo-labels and use them to train a fully supervised semantic segmentation model. Although these pseudo-labels are class-aware, indicating the coarse regions for particular classes, they are not object-aware and fail to delineate accurate object boundaries. To address this, we introduce a simple yet effective method harnessing the Segment Anything Model (SAM), a class-agnostic foundation model capable of producing fine-grained instance masks of objects, parts, and subparts. We use CAM pseudo-labels as cues to select and combine SAM masks, resulting in high-quality pseudo-labels that are both class-aware and object-aware. Our approach is highly versatile and can be easily integrated into existing WSSS methods without any modification. Despite its simplicity, our approach shows consistent gain over the state-of-the-art WSSS methods on both PASCAL VOC and MS-COCO datasets.

Cite

Text

Chen et al. "Segment Anything Model (SAM) Enhances Pseudo-Labels for Weakly Supervised Semantic Segmentation." NeurIPS 2023 Workshops: ICBINB, 2023.

Markdown

[Chen et al. "Segment Anything Model (SAM) Enhances Pseudo-Labels for Weakly Supervised Semantic Segmentation." NeurIPS 2023 Workshops: ICBINB, 2023.](https://mlanthology.org/neuripsw/2023/chen2023neuripsw-segment/)

BibTeX

@inproceedings{chen2023neuripsw-segment,
  title     = {{Segment Anything Model (SAM) Enhances Pseudo-Labels for Weakly Supervised Semantic Segmentation}},
  author    = {Chen, Tianle and Mai, Zheda and Li, Ruiwen and Chao, Wei-Lun},
  booktitle = {NeurIPS 2023 Workshops: ICBINB},
  year      = {2023},
  url       = {https://mlanthology.org/neuripsw/2023/chen2023neuripsw-segment/}
}