Open-World Instance Segmentation: Exploiting Pseudo Ground Truth from Learned Pairwise Affinity

Abstract

Open-world instance segmentation is the task of grouping pixels into object instances without any pre-determined taxonomy. This is challenging, as state-of-the-art methods rely on explicit class semantics obtained from large labeled datasets, and out-of-domain evaluation performance drops significantly. Here we propose a novel approach for mask proposals, Generic Grouping Networks (GGNs), constructed without semantic supervision. Our approach combines a local measure of pixel affinity with instance-level mask supervision, producing a training regimen designed to make the model as generic as the data diversity allows. We introduce a method for predicting Pairwise Affinities (PA), a learned local relationship between pairs of pixels. PA generalizes very well to unseen categories. From PA we construct a large set of pseudo-ground-truth instance masks; combined with human-annotated instance masks we train GGNs and significantly outperform the SOTA on open-world instance segmentation on various benchmarks including COCO, LVIS, ADE20K, and UVO.

Cite

Text

Wang et al. "Open-World Instance Segmentation: Exploiting Pseudo Ground Truth from Learned Pairwise Affinity." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.00438

Markdown

[Wang et al. "Open-World Instance Segmentation: Exploiting Pseudo Ground Truth from Learned Pairwise Affinity." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/wang2022cvpr-openworld/) doi:10.1109/CVPR52688.2022.00438

BibTeX

@inproceedings{wang2022cvpr-openworld,
  title     = {{Open-World Instance Segmentation: Exploiting Pseudo Ground Truth from Learned Pairwise Affinity}},
  author    = {Wang, Weiyao and Feiszli, Matt and Wang, Heng and Malik, Jitendra and Tran, Du},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2022},
  pages     = {4422-4432},
  doi       = {10.1109/CVPR52688.2022.00438},
  url       = {https://mlanthology.org/cvpr/2022/wang2022cvpr-openworld/}
}