Label-Efficient Few-Shot Semantic Segmentation with Unsupervised Meta-Training

Abstract

The goal of this paper is to alleviate the training cost for few-shot semantic segmentation (FSS) models. Despite that FSS in nature improves model generalization to new concepts using only a handful of test exemplars, it relies on strong supervision from a considerable amount of labeled training data for base classes. However, collecting pixel-level annotations is notoriously expensive and time-consuming, and small-scale training datasets convey low information density that limits test-time generalization. To resolve the issue, we take a pioneering step towards label-efficient training of FSS models from fully unlabeled training data, or additionally a few labeled samples to enhance the performance. This motivates an approach based on a novel unsupervised meta-training paradigm. In particular, the approach first distills pre-trained unsupervised pixel embedding into compact semantic clusters from which a massive number of pseudo meta-tasks is constructed. To mitigate the noise in the pseudo meta-tasks, we further advocate a robust Transformer-based FSS model with a novel prototype-based cross-attention design. Extensive experiments have been conducted on two standard benchmarks, i.e., PASCAL-5i and COCO-20i, and the results show that our method produces impressive performance without any annotations, and is comparable to fully supervised competitors even using only 20% of the annotations. Our code is available at: https://github.com/SSSKYue/UMTFSS.

Cite

Text

Li et al. "Label-Efficient Few-Shot Semantic Segmentation with Unsupervised Meta-Training." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I4.28094

Markdown

[Li et al. "Label-Efficient Few-Shot Semantic Segmentation with Unsupervised Meta-Training." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/li2024aaai-label/) doi:10.1609/AAAI.V38I4.28094

BibTeX

@inproceedings{li2024aaai-label,
  title     = {{Label-Efficient Few-Shot Semantic Segmentation with Unsupervised Meta-Training}},
  author    = {Li, Jianwu and Shi, Kaiyue and Xie, Guo-Sen and Liu, Xiaofeng and Zhang, Jian and Zhou, Tianfei},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {3109-3117},
  doi       = {10.1609/AAAI.V38I4.28094},
  url       = {https://mlanthology.org/aaai/2024/li2024aaai-label/}
}