Open-Vocabulary Semantic Segmentation Using Test-Time Distillation

Abstract

Semantic segmentation is a key computer vision task that has been actively researched for decades. In recent years, supervised methods have reached unprecedented accuracy; however, obtaining pixel-level annotation is very time-consuming and expensive. In this paper, we propose a novel open-vocabulary approach to creating semantic segmentation masks, without the need for training segmentation networks or seeing any segmentation masks. At test time, our method takes as input the image-level labels of the categories present in the image. We utilize a vision-language embedding model to create a rough segmentation map for each class via model interpretability methods and refine the maps using a test-time augmentation technique. The output of this stage provides pixel-level pseudo-labels, which are utilized by single-image segmentation techniques to obtain high-quality output segmentations. Our method is shown quantitatively and qualitatively to outperform methods that use a similar amount of supervision, and to be competitive with weakly-supervised semantic-segmentation techniques.

Cite

Text

Zabari and Hoshen. "Open-Vocabulary Semantic Segmentation Using Test-Time Distillation." European Conference on Computer Vision Workshops, 2022. doi:10.1007/978-3-031-25063-7_4

Markdown

[Zabari and Hoshen. "Open-Vocabulary Semantic Segmentation Using Test-Time Distillation." European Conference on Computer Vision Workshops, 2022.](https://mlanthology.org/eccvw/2022/zabari2022eccvw-openvocabulary/) doi:10.1007/978-3-031-25063-7_4

BibTeX

@inproceedings{zabari2022eccvw-openvocabulary,
  title     = {{Open-Vocabulary Semantic Segmentation Using Test-Time Distillation}},
  author    = {Zabari, Nir and Hoshen, Yedid},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2022},
  pages     = {56-72},
  doi       = {10.1007/978-3-031-25063-7_4},
  url       = {https://mlanthology.org/eccvw/2022/zabari2022eccvw-openvocabulary/}
}