Hierarchical Dense Correlation Distillation for Few-Shot Segmentation
Abstract
Few-shot semantic segmentation (FSS) aims to form class-agnostic models segmenting unseen classes with only a handful of annotations. Previous methods limited to the semantic feature and prototype representation suffer from coarse segmentation granularity and train-set overfitting. In this work, we design Hierarchically Decoupled Matching Network (HDMNet) mining pixel-level support correlation based on the transformer architecture. The self-attention modules are used to assist in establishing hierarchical dense features, as a means to accomplish the cascade matching between query and support features. Moreover, we propose a matching module to reduce train-set overfitting and introduce correlation distillation leveraging semantic correspondence from coarse resolution to boost fine-grained segmentation. Our method performs decently in experiments. We achieve 50.0% mIoU on COCO-5i dataset one-shot setting and 56.0% on five-shot segmentation, respectively. The code is available on the project website.
Cite
Text
Peng et al. "Hierarchical Dense Correlation Distillation for Few-Shot Segmentation." Conference on Computer Vision and Pattern Recognition, 2023. doi:10.1109/CVPR52729.2023.02264Markdown
[Peng et al. "Hierarchical Dense Correlation Distillation for Few-Shot Segmentation." Conference on Computer Vision and Pattern Recognition, 2023.](https://mlanthology.org/cvpr/2023/peng2023cvpr-hierarchical/) doi:10.1109/CVPR52729.2023.02264BibTeX
@inproceedings{peng2023cvpr-hierarchical,
title = {{Hierarchical Dense Correlation Distillation for Few-Shot Segmentation}},
author = {Peng, Bohao and Tian, Zhuotao and Wu, Xiaoyang and Wang, Chengyao and Liu, Shu and Su, Jingyong and Jia, Jiaya},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2023},
pages = {23641-23651},
doi = {10.1109/CVPR52729.2023.02264},
url = {https://mlanthology.org/cvpr/2023/peng2023cvpr-hierarchical/}
}