Incremental Few-Shot Semantic Segmentation via Multi-Level Switchable Visual Prompts
Abstract
Existing incremental few-shot semantic segmentation (IFSS) methods often learn novel classes by fine-tuning parameters from previous stages. This inevitably reduces the distinguishability of old class features, leading to catastrophic forgetting and overfitting to limited new samples. In this paper, we propose a novel prompt-based IFSS method with a visual prompt pool to store and switch multi-granular knowledge across stages, enhancing the model's ability to learn new classes. Specifically, we introduce three levels of prompts: 1) Task-persistent prompts: capturing generalizable knowledge shared across stages, such as foreground-background distributions, to ensure consistent recognition guidance; 2) Stage-specific prompts: adapting to the unique requirements of each stage by integrating its discriminative knowledge (e.g., shape difference) with common knowledge from previous stages; and 3) Region-unique prompts: encoding category-specific structures (e.g., edges) to more accurately guide the model to retain local details. In particular, we introduce a prompt switching mechanism that adaptively allocates the knowledge required for base and new classes, avoiding interference between prompts and preventing catastrophic forgetting and reducing the increasing computation. Our method achieves a new state-of-the-art performance, outperforming previous SoTA methods by 30.28% mIoU-N on VOC and 13.90% mIoU-N on COCO under 1-shot setting. Code is available at https://github.com/WanMotion/MSVP.
Cite
Text
Wan et al. "Incremental Few-Shot Semantic Segmentation via Multi-Level Switchable Visual Prompts." International Conference on Computer Vision, 2025.Markdown
[Wan et al. "Incremental Few-Shot Semantic Segmentation via Multi-Level Switchable Visual Prompts." International Conference on Computer Vision, 2025.](https://mlanthology.org/iccv/2025/wan2025iccv-incremental/)BibTeX
@inproceedings{wan2025iccv-incremental,
title = {{Incremental Few-Shot Semantic Segmentation via Multi-Level Switchable Visual Prompts}},
author = {Wan, Maoxian and Li, Kaige and Geng, Qichuan and Shi, Weimin and Zhou, Zhong},
booktitle = {International Conference on Computer Vision},
year = {2025},
pages = {24113-24122},
url = {https://mlanthology.org/iccv/2025/wan2025iccv-incremental/}
}