Zero-Shot Semantic Segmentation via Variational Mapping

Naoki Kato, Toshihiko Yamasaki, Kiyoharu Aizawa

ICCVW 2019 pp. 1363-1370

doi:10.1109/ICCVW.2019.00172 /iccvw/2019/kato2019iccvw-zeroshot/

Abstract

We have witnessed the explosive success of deep neural networks (DNNs). However, DNNs typically assume a large amount of training data, and this is not always available in practical scenarios. In this paper, we present zero-shot semantic segmentation, where a model that has never seen the target class during training. For this purpose, we propose variational mapping, which facilitates effective learning by mapping the class label embedding vectors from the semantic space to the visual space. Experimental results using Pascal VOC 2012 show that our proposed method can achieve a mean intersection over union (mIoU) of 42.2, and we believe that this can serve as a baseline for similar research in the future.

ICCVW Semantic Scholar

Cite

Text

Kato et al. "Zero-Shot Semantic Segmentation via Variational Mapping." IEEE/CVF International Conference on Computer Vision Workshops, 2019. doi:10.1109/ICCVW.2019.00172

Markdown

[Kato et al. "Zero-Shot Semantic Segmentation via Variational Mapping." IEEE/CVF International Conference on Computer Vision Workshops, 2019.](https://mlanthology.org/iccvw/2019/kato2019iccvw-zeroshot/) doi:10.1109/ICCVW.2019.00172

BibTeX

@inproceedings{kato2019iccvw-zeroshot,
  title     = {{Zero-Shot Semantic Segmentation via Variational Mapping}},
  author    = {Kato, Naoki and Yamasaki, Toshihiko and Aizawa, Kiyoharu},
  booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
  year      = {2019},
  pages     = {1363-1370},
  doi       = {10.1109/ICCVW.2019.00172},
  url       = {https://mlanthology.org/iccvw/2019/kato2019iccvw-zeroshot/}
}