FastMask: Segment Multi-Scale Object Candidates in One Shot

Abstract

Objects appear to scale differently in natural images. This fact requires methods dealing with object-centric tasks (e.g. object proposal) to have robust performance over variances in object scales. In the paper, we present a novel segment proposal framework, namely FastMask, which takes advantage of hierarchical features in deep convolutional neural networks to segment multi-scale objects in one shot. Innovatively, we adapt segment proposal network into three different functional components (body, neck and head). We further propose a weight-shared residual neck module as well as a scale-tolerant attentional head module for efficient one-shot inference. On MS COCO benchmark, the proposed FastMask outperforms all state-of-the-art segment proposal methods in average recall being 2 5 times faster. Moreover, with a slight trade-off in accuracy, FastMask can segment objects in near real time ( 13 fps) with 800*600 resolution images, demonstrating its potential in practical applications. Our implementation is available on https://github.com/voidrank/FastMask.

Cite

Text

Hu et al. "FastMask: Segment Multi-Scale Object Candidates in One Shot." Conference on Computer Vision and Pattern Recognition, 2017. doi:10.1109/CVPR.2017.245

Markdown

[Hu et al. "FastMask: Segment Multi-Scale Object Candidates in One Shot." Conference on Computer Vision and Pattern Recognition, 2017.](https://mlanthology.org/cvpr/2017/hu2017cvpr-fastmask/) doi:10.1109/CVPR.2017.245

BibTeX

@inproceedings{hu2017cvpr-fastmask,
  title     = {{FastMask: Segment Multi-Scale Object Candidates in One Shot}},
  author    = {Hu, Hexiang and Lan, Shiyi and Jiang, Yuning and Cao, Zhimin and Sha, Fei},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2017},
  doi       = {10.1109/CVPR.2017.245},
  url       = {https://mlanthology.org/cvpr/2017/hu2017cvpr-fastmask/}
}