Webly Supervised Semantic Segmentation
Abstract
We propose a weakly supervised semantic segmentation algorithm that uses image tags for supervision. We apply the tags in queries to collect three sets of web images, which encode the clean foregrounds, the common back- grounds, and realistic scenes of the classes. We introduce a novel three-stage training pipeline to progressively learn semantic segmentation models. We first train and refine a class-specific shallow neural network to obtain segmentation masks for each class. The shallow neural networks of all classes are then assembled into one deep convolutional neural network for end-to-end training and testing. Experiments show that our method notably outperforms previous state-of-the-art weakly supervised semantic segmentation approaches on the PASCAL VOC 2012 segmentation bench- mark. We further apply the class-specific shallow neural networks to object segmentation and obtain excellent results.
Cite
Text
Jin et al. "Webly Supervised Semantic Segmentation." Conference on Computer Vision and Pattern Recognition, 2017. doi:10.1109/CVPR.2017.185Markdown
[Jin et al. "Webly Supervised Semantic Segmentation." Conference on Computer Vision and Pattern Recognition, 2017.](https://mlanthology.org/cvpr/2017/jin2017cvpr-webly/) doi:10.1109/CVPR.2017.185BibTeX
@inproceedings{jin2017cvpr-webly,
title = {{Webly Supervised Semantic Segmentation}},
author = {Jin, Bin and Segovia, Maria V. Ortiz and Susstrunk, Sabine},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2017},
doi = {10.1109/CVPR.2017.185},
url = {https://mlanthology.org/cvpr/2017/jin2017cvpr-webly/}
}