SegRCDB: Semantic Segmentation via Formula-Driven Supervised Learning

Abstract

Pre-training is a strong strategy for enhancing visual models to efficiently train them with a limited number of labeled images. In semantic segmentation, creating annotation masks requires an intensive amount of labor and time, and therefore, a large-scale pre-training dataset with semantic labels is quite difficult to construct. Moreover, what matters in semantic segmentation pre-training has not been fully investigated. In this paper, we propose the Segmentation Radial Contour DataBase (SegRCDB), which for the first time applies formula-driven supervised learning for semantic segmentation. SegRCDB enables pre-training for semantic segmentation without real images or any manual semantic labels. SegRCDB is based on insights about what is important in pre-training for semantic segmentation and allows efficient pre-training. Pre-training with SegRCDB achieved higher mIoU than the pre-training with COCO-Stuff for fine-tuning on ADE-20k and Cityscapes with the same number of training images. SegRCDB has a high potential to contribute to semantic segmentation pre-training and investigation by enabling the creation of large datasets without manual annotation. The SegRCDB dataset will be released under a license that allows research and commercial use.

Cite

Text

Shinoda et al. "SegRCDB: Semantic Segmentation via Formula-Driven Supervised Learning." International Conference on Computer Vision, 2023. doi:10.1109/ICCV51070.2023.01835

Markdown

[Shinoda et al. "SegRCDB: Semantic Segmentation via Formula-Driven Supervised Learning." International Conference on Computer Vision, 2023.](https://mlanthology.org/iccv/2023/shinoda2023iccv-segrcdb/) doi:10.1109/ICCV51070.2023.01835

BibTeX

@inproceedings{shinoda2023iccv-segrcdb,
  title     = {{SegRCDB: Semantic Segmentation via Formula-Driven Supervised Learning}},
  author    = {Shinoda, Risa and Hayamizu, Ryo and Nakashima, Kodai and Inoue, Nakamasa and Yokota, Rio and Kataoka, Hirokatsu},
  booktitle = {International Conference on Computer Vision},
  year      = {2023},
  pages     = {20054-20063},
  doi       = {10.1109/ICCV51070.2023.01835},
  url       = {https://mlanthology.org/iccv/2023/shinoda2023iccv-segrcdb/}
}