Webly Supervised Semantic Segmentation

Bin Jin, Maria V. Ortiz Segovia, Sabine Susstrunk

CVPR 2017

doi:10.1109/CVPR.2017.185 /cvpr/2017/jin2017cvpr-webly/

Abstract

We propose a weakly supervised semantic segmentation algorithm that uses image tags for supervision. We apply the tags in queries to collect three sets of web images, which encode the clean foregrounds, the common back- grounds, and realistic scenes of the classes. We introduce a novel three-stage training pipeline to progressively learn semantic segmentation models. We first train and refine a class-specific shallow neural network to obtain segmentation masks for each class. The shallow neural networks of all classes are then assembled into one deep convolutional neural network for end-to-end training and testing. Experiments show that our method notably outperforms previous state-of-the-art weakly supervised semantic segmentation approaches on the PASCAL VOC 2012 segmentation bench- mark. We further apply the class-specific shallow neural networks to object segmentation and obtain excellent results.

PDF CVPR Semantic Scholar

Cite

Text

Jin et al. "Webly Supervised Semantic Segmentation." Conference on Computer Vision and Pattern Recognition, 2017. doi:10.1109/CVPR.2017.185

Markdown

[Jin et al. "Webly Supervised Semantic Segmentation." Conference on Computer Vision and Pattern Recognition, 2017.](https://mlanthology.org/cvpr/2017/jin2017cvpr-webly/) doi:10.1109/CVPR.2017.185

BibTeX

@inproceedings{jin2017cvpr-webly,
  title     = {{Webly Supervised Semantic Segmentation}},
  author    = {Jin, Bin and Segovia, Maria V. Ortiz and Susstrunk, Sabine},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2017},
  doi       = {10.1109/CVPR.2017.185},
  url       = {https://mlanthology.org/cvpr/2017/jin2017cvpr-webly/}
}