2D Amodal Instance Segmentation Guided by 3D Shape Prior

Abstract

Amodal instance segmentation aims to predict the complete mask of the occluded instance, including both visible and invisible regions. Existing 2D AIS methods learn and predict the complete silhouettes of target instances in 2D space. However, masks in 2D space are only some observations and samples from the 3D model in different viewpoints and thus can not represent the real complete physical shape of the instances. With the 2D masks learned, 2D amodal methods are hard to generalize to new viewpoints not included in the training dataset. To tackle these problems, we are motivated by observations that (1) a 2D amodal mask is the projection of a 3D complete model, and (2) the 3D complete model can be recovered and reconstructed from the occluded 2D object instances. This paper builds a bridge to link the 2D occluded instances with the 3D complete models by 3D reconstruction and utilizes 3D shape prior for 2D AIS. To deal with the diversity of 3D shapes, our method is pretrained on large 3D reconstruction datasets for high-quality results. And we adopt the unsupervised 3D reconstruction method to avoid relying on 3D annotations. In this approach, our method can reconstruct 3D models from occluded 2D object instances and generalize to new unseen 2D viewpoints of the 3D object. Experiments demonstrate that our method outperforms all existing 2D AIS methods. Our code will be released.

Cite

Text

Li et al. "2D Amodal Instance Segmentation Guided by 3D Shape Prior." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19818-2_10

Markdown

[Li et al. "2D Amodal Instance Segmentation Guided by 3D Shape Prior." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/li2022eccv-2d/) doi:10.1007/978-3-031-19818-2_10

BibTeX

@inproceedings{li2022eccv-2d,
  title     = {{2D Amodal Instance Segmentation Guided by 3D Shape Prior}},
  author    = {Li, Zhixuan and Ye, Weining and Jiang, Tingting and Huang, Tiejun},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2022},
  doi       = {10.1007/978-3-031-19818-2_10},
  url       = {https://mlanthology.org/eccv/2022/li2022eccv-2d/}
}