Weakly Supervised Few-Shot Object Segmentation Using Co-Attention with Visual and Semantic Embeddings

Mennatullah Siam, Naren Doraiswamy, Boris N. Oreshkin, Hengshuai Yao, Martin Jägersand

IJCAI 2020 pp. 860-867

doi:10.24963/IJCAI.2020/120 /ijcai/2020/siam2020ijcai-weakly/

Abstract

Significant progress has been made recently in developing few-shot object segmentation methods. Learning is shown to be successful in few-shot segmentation settings, using pixel-level, scribbles and bounding box supervision. This paper takes another approach, i.e., only requiring image-level label for few-shot object segmentation. We propose a novel multi-modal interaction module for few-shot object segmentation that utilizes a co-attention mechanism using both visual and word embedding. Our model using image-level labels achieves 4.8% improvement over previously proposed image-level few-shot object segmentation. It also outperforms state-of-the-art methods that use weak bounding box supervision on PASCAL-5^i. Our results show that few-shot segmentation benefits from utilizing word embeddings, and that we are able to perform few-shot segmentation using stacked joint visual semantic processing with weak image-level labels. We further propose a novel setup, Temporal Object Segmentation for Few-shot Learning (TOSFL) for videos. TOSFL can be used on a variety of public video data such as Youtube-VOS, as demonstrated in both instance-level and category-level TOSFL experiments.

PDF IJCAI Semantic Scholar

Cite

Text

Siam et al. "Weakly Supervised Few-Shot Object Segmentation Using Co-Attention with Visual and Semantic Embeddings." International Joint Conference on Artificial Intelligence, 2020. doi:10.24963/IJCAI.2020/120

Markdown

[Siam et al. "Weakly Supervised Few-Shot Object Segmentation Using Co-Attention with Visual and Semantic Embeddings." International Joint Conference on Artificial Intelligence, 2020.](https://mlanthology.org/ijcai/2020/siam2020ijcai-weakly/) doi:10.24963/IJCAI.2020/120

BibTeX

@inproceedings{siam2020ijcai-weakly,
  title     = {{Weakly Supervised Few-Shot Object Segmentation Using Co-Attention with Visual and Semantic Embeddings}},
  author    = {Siam, Mennatullah and Doraiswamy, Naren and Oreshkin, Boris N. and Yao, Hengshuai and Jägersand, Martin},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2020},
  pages     = {860-867},
  doi       = {10.24963/IJCAI.2020/120},
  url       = {https://mlanthology.org/ijcai/2020/siam2020ijcai-weakly/}
}