Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors

Abstract

The performance of existing single-view 3D reconstruction methods heavily relies on large-scale of 3D annotations. However, such annotations are tedious and expensive to collect. Semi-supervised learning serves as an alternative way to mitigate the need for manual labels, but remains unexplored in 3D reconstruction. Inspired by the recent success of self-ensembling method in semi-supervised image classification task, we first propose SSP3D, a semi-supervised framework for 3D reconstruction. In particular, we introduce an attention-guided prototype shape prior module for guiding realistic object reconstruction. we further introduce a discriminator-guided module to incentivize better shape generation, as well as a regularizer to tolerate noisy training samples. On the ShapeNet benchmark, the proposed approach outperforms previous supervised methods by clear margins margin under various labeling ratios, ( i.e., 1%, 5%, 10% and 20%). Moreover, our approach also performs well when transferring to real-world Pix3D datasets under labeling ratios of 10%. We also demonstrate our method could transfer to novel categories with few novel supervised data. Experiments on the popular ShapeNet dataset show that our method outperforms the zero-shot baseline by over 12% and the current state-of-the-art by over 7% in the few-shot setting.

Cite

Text

Xing et al. "Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19769-7_31

Markdown

[Xing et al. "Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/xing2022eccv-semisupervised/) doi:10.1007/978-3-031-19769-7_31

BibTeX

@inproceedings{xing2022eccv-semisupervised,
  title     = {{Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors}},
  author    = {Xing, Zhen and Li, Hengduo and Wu, Zuxuan and Jiang, Yu-Gang},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2022},
  doi       = {10.1007/978-3-031-19769-7_31},
  url       = {https://mlanthology.org/eccv/2022/xing2022eccv-semisupervised/}
}