Instance as Identity: A Generic Online Paradigm for Video Instance Segmentation

Abstract

Modeling temporal information for both detection and tracking in a unified framework has been proved a promising solution to video instance segmentation (VIS). However, how to effectively incorporate the temporal information into an online model remains an open problem. In this work, we propose a new online VIS paradigm named Instance As Identity (IAI), which models temporal information for both detection and tracking in an efficient way. In detail, IAI employs a novel identification module to predict identification number for tracking instances explicitly. For passing temporal information cross frame, IAI utilizes an association module which combines current features and past embeddings. Notably, IAI can be integrated with different image models. We conduct extensive experiments on three VIS benchmarks. IAI outperforms all the online competitors on YouTube-VIS-2019 (ResNet-101 41.9 mAP) and YouTube-VIS-2021 (ResNet-50 37.7 mAP). Surprisingly, on the more challenging OVIS, IAI achieves SOTA performance (20.3 mAP). Code is available at https://github.com/zfonemore/IAI keywords: Video Instance Segmentation

Cite

Text

Zhu et al. "Instance as Identity: A Generic Online Paradigm for Video Instance Segmentation." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19818-2_30

Markdown

[Zhu et al. "Instance as Identity: A Generic Online Paradigm for Video Instance Segmentation." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/zhu2022eccv-instance/) doi:10.1007/978-3-031-19818-2_30

BibTeX

@inproceedings{zhu2022eccv-instance,
  title     = {{Instance as Identity: A Generic Online Paradigm for Video Instance Segmentation}},
  author    = {Zhu, Feng and Yang, Zongxin and Yu, Xin and Yang, Yi and Wei, Yunchao},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2022},
  doi       = {10.1007/978-3-031-19818-2_30},
  url       = {https://mlanthology.org/eccv/2022/zhu2022eccv-instance/}
}