SIGN: Spatial-Information Incorporated Generative Network for Generalized Zero-Shot Semantic Segmentation

Abstract

Unlike conventional zero-shot classification, zero-shot semantic segmentation predicts a class label at the pixel level instead of the image level. When solving zero-shot semantic segmentation problems, the need for pixel-level prediction with surrounding context motivates us to incorporate spatial information using positional encoding. We improve standard positional encoding by introducing the concept of Relative Positional Encoding, which integrates spatial information at the feature level and can handle arbitrary image sizes. Furthermore, while self-training is widely used in zero-shot semantic segmentation to generate pseudo-labels, we propose a new knowledge-distillation-inspired self-training strategy, namely Annealed Self-Training, which can automatically assign different importance to pseudo-labels to improve performance. We systematically study the proposed Relative Positional Encoding and Annealed Self-Training in a comprehensive experimental evaluation, and our empirical results confirm the effectiveness of our method on three benchmark datasets.

Cite

Text

Cheng et al. "SIGN: Spatial-Information Incorporated Generative Network for Generalized Zero-Shot Semantic Segmentation." International Conference on Computer Vision, 2021. doi:10.1109/ICCV48922.2021.00942

Markdown

[Cheng et al. "SIGN: Spatial-Information Incorporated Generative Network for Generalized Zero-Shot Semantic Segmentation." International Conference on Computer Vision, 2021.](https://mlanthology.org/iccv/2021/cheng2021iccv-sign/) doi:10.1109/ICCV48922.2021.00942

BibTeX

@inproceedings{cheng2021iccv-sign,
  title     = {{SIGN: Spatial-Information Incorporated Generative Network for Generalized Zero-Shot Semantic Segmentation}},
  author    = {Cheng, Jiaxin and Nandi, Soumyaroop and Natarajan, Prem and Abd-Almageed, Wael},
  booktitle = {International Conference on Computer Vision},
  year      = {2021},
  pages     = {9556-9566},
  doi       = {10.1109/ICCV48922.2021.00942},
  url       = {https://mlanthology.org/iccv/2021/cheng2021iccv-sign/}
}