Weakly Supervised Few-Shot Object Segmentation Using Co-Attention with Visual and Semantic Embeddings
Abstract
Significant progress has been made recently in developing few-shot object segmentation methods. Learning is shown to be successful in few-shot segmentation settings, using pixel-level, scribbles and bounding box supervision. This paper takes another approach, i.e., only requiring image-level label for few-shot object segmentation. We propose a novel multi-modal interaction module for few-shot object segmentation that utilizes a co-attention mechanism using both visual and word embedding. Our model using image-level labels achieves 4.8% improvement over previously proposed image-level few-shot object segmentation. It also outperforms state-of-the-art methods that use weak bounding box supervision on PASCAL-5^i. Our results show that few-shot segmentation benefits from utilizing word embeddings, and that we are able to perform few-shot segmentation using stacked joint visual semantic processing with weak image-level labels. We further propose a novel setup, Temporal Object Segmentation for Few-shot Learning (TOSFL) for videos. TOSFL can be used on a variety of public video data such as Youtube-VOS, as demonstrated in both instance-level and category-level TOSFL experiments.
Cite
Text
Siam et al. "Weakly Supervised Few-Shot Object Segmentation Using Co-Attention with Visual and Semantic Embeddings." International Joint Conference on Artificial Intelligence, 2020. doi:10.24963/IJCAI.2020/120Markdown
[Siam et al. "Weakly Supervised Few-Shot Object Segmentation Using Co-Attention with Visual and Semantic Embeddings." International Joint Conference on Artificial Intelligence, 2020.](https://mlanthology.org/ijcai/2020/siam2020ijcai-weakly/) doi:10.24963/IJCAI.2020/120BibTeX
@inproceedings{siam2020ijcai-weakly,
title = {{Weakly Supervised Few-Shot Object Segmentation Using Co-Attention with Visual and Semantic Embeddings}},
author = {Siam, Mennatullah and Doraiswamy, Naren and Oreshkin, Boris N. and Yao, Hengshuai and Jägersand, Martin},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2020},
pages = {860-867},
doi = {10.24963/IJCAI.2020/120},
url = {https://mlanthology.org/ijcai/2020/siam2020ijcai-weakly/}
}