Primitive3D: 3D Object Dataset Synthesis from Randomly Assembled Primitives

Abstract

Numerous advancements of deep learning can be attributed to access to large-scale and well-annotated datasets. However, such a dataset is prohibitively expensive in 3D computer vision due to the substantial collection cost. To alleviate this issue, we propose a cost-effective method for automatically generating a large amount of 3D objects with annotations. In particular, we synthesize objects simply by assembling multiple random primitives. These objects are thus auto-annotated with part-based labels originating from primitives. This allows us to perform multi-task learning by combining the supervised segmentation with unsupervised reconstruction. Considering the large overhead of learning on the generated dataset, we further propose a dataset distillation strategy to remove redundant samples regarding a target dataset. We conduct extensive experiments for the downstream tasks of 3D object classification. The results indicate that our dataset, together with multi-task pretraining on its annotations, achieves the best performance compared to other commonly used datasets. Further study suggests that our strategy can improve the model performance by pretraining and fine-tuning scheme, especially for a dataset with a small scale. In addition, pretraining with the proposed dataset distillation method can save 86% of the pretraining time with negligible performance degradation. We expect that our attempt provides a new data-centric perspective for training 3D deep models.

Cite

Text

Li et al. "Primitive3D: 3D Object Dataset Synthesis from Randomly Assembled Primitives." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.01548

Markdown

[Li et al. "Primitive3D: 3D Object Dataset Synthesis from Randomly Assembled Primitives." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/li2022cvpr-primitive3d/) doi:10.1109/CVPR52688.2022.01548

BibTeX

@inproceedings{li2022cvpr-primitive3d,
  title     = {{Primitive3D: 3D Object Dataset Synthesis from Randomly Assembled Primitives}},
  author    = {Li, Xinke and Ding, Henghui and Tong, Zekun and Wu, Yuwei and Chee, Yeow Meng},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2022},
  pages     = {15947-15957},
  doi       = {10.1109/CVPR52688.2022.01548},
  url       = {https://mlanthology.org/cvpr/2022/li2022cvpr-primitive3d/}
}