Point Cloud Pre-Training with Natural 3D Structures

Yamada, Ryosuke; Kataoka, Hirokatsu; Chiba, Naoya; Domae, Yukiyasu; Ogata, Tetsuya

doi:10.1109/CVPR52688.2022.02060

Point Cloud Pre-Training with Natural 3D Structures

Ryosuke Yamada, Hirokatsu Kataoka, Naoya Chiba, Yukiyasu Domae, Tetsuya Ogata

CVPR 2022 pp. 21283-21293

doi:10.1109/CVPR52688.2022.02060 /cvpr/2022/yamada2022cvpr-point/

Abstract

The construction of 3D point cloud datasets requires a great deal of human effort. Therefore, constructing a largescale 3D point clouds dataset is difficult. In order to remedy this issue, we propose a newly developed point cloud fractal database (PC-FractalDB), which is a novel family of formula-driven supervised learning inspired by fractal geometry encountered in natural 3D structures. Our research is based on the hypothesis that we could learn representations from more real-world 3D patterns than conventional 3D datasets by learning fractal geometry. We show how the PC-FractalDB facilitates solving several recent dataset-related problems in 3D scene understanding, such as 3D model collection and labor-intensive annotation. The experimental section shows how we achieved the performance rate of up to 61.9% and 59.0% for the ScanNetV2 and SUN RGB-D datasets, respectively, over the current highest scores obtained with the PointContrast, contrastive scene contexts (CSC), and RandomRooms. Moreover, the PC-FractalDB pre-trained model is especially effective in training with limited data. For example, in 10% of training data on ScanNetV2, the PC-FractalDB pre-trained VoteNet performs at 38.3%, which is +14.8% higher accuracy than CSC. Of particular note, we found that the proposed method achieves the highest results for 3D object detection pre-training in limited point cloud data.

PDF CVPR Semantic Scholar

Cite

Text

Yamada et al. "Point Cloud Pre-Training with Natural 3D Structures." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.02060

Markdown

[Yamada et al. "Point Cloud Pre-Training with Natural 3D Structures." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/yamada2022cvpr-point/) doi:10.1109/CVPR52688.2022.02060

BibTeX

@inproceedings{yamada2022cvpr-point,
  title     = {{Point Cloud Pre-Training with Natural 3D Structures}},
  author    = {Yamada, Ryosuke and Kataoka, Hirokatsu and Chiba, Naoya and Domae, Yukiyasu and Ogata, Tetsuya},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2022},
  pages     = {21283-21293},
  doi       = {10.1109/CVPR52688.2022.02060},
  url       = {https://mlanthology.org/cvpr/2022/yamada2022cvpr-point/}
}