General-Purpose Deep Point Cloud Feature Extractor

Abstract

Depth sensors used in autonomous driving and gaming systems often report back 3D point clouds. The lack of structure from these sensors does not allow these systems to take advantage of recent advances in convolutional neural networks which are dependent upon traditional filtering and pooling operations. Analogous to image based convolutional architectures, recently introduced graph based architectures afford similar filtering and pooling operations on arbitrary graphs. We adopt these graph based methods to 3D point clouds to introduce a generic vector representation of 3D graphs, we call graph 3D (G3D). We believe we are the first to use large scale transfer learning on 3D point cloud data and demonstrate the discriminant power of our salient latent representation of 3D point clouds on unforeseen test sets. By using our G3D network (G3DNet) as a feature extractor, and then pairing G3D feature vectors with a standard classifier, we achieve the best accuracy on ModelNet10 (93.1%) and ModelNet 40 (91.7%) for a graph network, and comparable performance on the Sydney Urban Objects dataset to other methods. This general-purpose feature extractor can be used as an off-the-shelf component in other 3D scene understanding or object tracking works.

Cite

Text

Domínguez et al. "General-Purpose Deep Point Cloud Feature Extractor." IEEE/CVF Winter Conference on Applications of Computer Vision, 2018. doi:10.1109/WACV.2018.00218

Markdown

[Domínguez et al. "General-Purpose Deep Point Cloud Feature Extractor." IEEE/CVF Winter Conference on Applications of Computer Vision, 2018.](https://mlanthology.org/wacv/2018/dominguez2018wacv-general/) doi:10.1109/WACV.2018.00218

BibTeX

@inproceedings{dominguez2018wacv-general,
  title     = {{General-Purpose Deep Point Cloud Feature Extractor}},
  author    = {Domínguez, Miguel and Dhamdhere, Rohan and Petkar, Atir and Jain, Saloni and Sah, Shagan and Ptucha, Raymond W.},
  booktitle = {IEEE/CVF Winter Conference on Applications of Computer Vision},
  year      = {2018},
  pages     = {1972-1981},
  doi       = {10.1109/WACV.2018.00218},
  url       = {https://mlanthology.org/wacv/2018/dominguez2018wacv-general/}
}