Open-World Semantic Segmentation Including Class Similarity

Abstract

Interpreting camera data is key for autonomously acting systems such as autonomous vehicles. Vision systems that operate in real-world environments must be able to understand their surroundings and need the ability to deal with novel situations. This paper tackles open-world semantic segmentation i.e. the variant of interpreting image data in which objects occur that have not been seen during training. We propose a novel approach that performs accurate closed-world semantic segmentation and at the same time can identify new categories without requiring any additional training data. Our approach additionally provides a similarity measure for every newly discovered class in an image to a known category which can be useful information in downstream tasks such as planning or mapping. Through extensive experiments we show that our model achieves state-of-the-art results on classes known from training data as well as for anomaly segmentation and can distinguish between different unknown classes.

Cite

Text

Sodano et al. "Open-World Semantic Segmentation Including Class Similarity." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.00307

Markdown

[Sodano et al. "Open-World Semantic Segmentation Including Class Similarity." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/sodano2024cvpr-openworld/) doi:10.1109/CVPR52733.2024.00307

BibTeX

@inproceedings{sodano2024cvpr-openworld,
  title     = {{Open-World Semantic Segmentation Including Class Similarity}},
  author    = {Sodano, Matteo and Magistri, Federico and Nunes, Lucas and Behley, Jens and Stachniss, Cyrill},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2024},
  pages     = {3184-3194},
  doi       = {10.1109/CVPR52733.2024.00307},
  url       = {https://mlanthology.org/cvpr/2024/sodano2024cvpr-openworld/}
}