Specialize and Fuse: Pyramidal Output Representation for Semantic Segmentation

Abstract

We present a novel pyramidal output representation to ensure parsimony with our "specialize and fuse" process for semantic segmentation. A pyramidal "output" representation consists of coarse-to-fine levels, where each level is "specialize" in a different class distribution (e.g., more stuff than things classes at coarser levels). Two types of pyramidal outputs (i.e., unity and semantic pyramid) are "fused" into the final semantic output, where the unity pyramid indicates unity-cells (i.e., all pixels in such cell share the same semantic label). The process ensures parsimony by predicting a relatively small number of labels for unity-cells (e.g., a large cell of grass) to build the final semantic output. In addition to the "output" representation, we design a coarse-to-fine contextual module to aggregate the "features" representation from different levels. We validate the effectiveness of each key module in our method through comprehensive ablation studies. Finally, our approach achieves state-of-the-art performance on three widely-used semantic segmentation datasets---ADE20K, COCO-Stuff, and Pascal-Context.

Cite

Text

Hsiao et al. "Specialize and Fuse: Pyramidal Output Representation for Semantic Segmentation." International Conference on Computer Vision, 2021. doi:10.1109/ICCV48922.2021.00705

Markdown

[Hsiao et al. "Specialize and Fuse: Pyramidal Output Representation for Semantic Segmentation." International Conference on Computer Vision, 2021.](https://mlanthology.org/iccv/2021/hsiao2021iccv-specialize/) doi:10.1109/ICCV48922.2021.00705

BibTeX

@inproceedings{hsiao2021iccv-specialize,
  title     = {{Specialize and Fuse: Pyramidal Output Representation for Semantic Segmentation}},
  author    = {Hsiao, Chi-Wei and Sun, Cheng and Chen, Hwann-Tzong and Sun, Min},
  booktitle = {International Conference on Computer Vision},
  year      = {2021},
  pages     = {7137-7146},
  doi       = {10.1109/ICCV48922.2021.00705},
  url       = {https://mlanthology.org/iccv/2021/hsiao2021iccv-specialize/}
}