Multi-Level 3D CNN for Learning Multi-Scale Spatial Features

Ghadai, Sambit; Lee, Xian Yeow; Balu, Aditya; Sarkar, Soumik; Krishnamurthy, Adarsh

doi:10.1109/CVPRW.2019.00150

Multi-Level 3D CNN for Learning Multi-Scale Spatial Features

Sambit Ghadai, Xian Yeow Lee, Aditya Balu, Soumik Sarkar, Adarsh Krishnamurthy

CVPRW 2019 pp. 1152-1156

doi:10.1109/CVPRW.2019.00150 /cvprw/2019/ghadai2019cvprw-multilevel/

Abstract

3D object recognition accuracy can be improved by learning the multi-scale spatial features from 3D spatial geometric representations of objects such as point clouds, 3D models, surfaces, and RGB-D data. Current deep learning approaches learn such features either using structured data representations (voxel grids and octrees) or from unstructured representations (graphs and point clouds). Learning features from such structured representations is limited by the restriction on resolution and tree depth while unstructured representations creates a challenge due to non-uniformity among data samples. In this paper, we propose an end-to-end multi-level learning approach on a multi-level voxel grid to overcome these drawbacks. To demonstrate the utility of the proposed multi-level learning, we use a multi-level voxel representation of 3D objects to perform object recognition. The multi-level voxel representation consists of a coarse voxel grid that contains volumetric information of the 3D object. In addition, each voxel in the coarse grid that contains a portion of the object boundary is subdivided into multiple fine-level voxel grids. The performance of our multi-level learning algorithm for object recognition is comparable to dense voxel representations while using significantly lower memory.

PDF CVPRW Semantic Scholar

Cite

Text

Ghadai et al. "Multi-Level 3D CNN for Learning Multi-Scale Spatial Features." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2019. doi:10.1109/CVPRW.2019.00150

Markdown

[Ghadai et al. "Multi-Level 3D CNN for Learning Multi-Scale Spatial Features." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2019.](https://mlanthology.org/cvprw/2019/ghadai2019cvprw-multilevel/) doi:10.1109/CVPRW.2019.00150

BibTeX

@inproceedings{ghadai2019cvprw-multilevel,
  title     = {{Multi-Level 3D CNN for Learning Multi-Scale Spatial Features}},
  author    = {Ghadai, Sambit and Lee, Xian Yeow and Balu, Aditya and Sarkar, Soumik and Krishnamurthy, Adarsh},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2019},
  pages     = {1152-1156},
  doi       = {10.1109/CVPRW.2019.00150},
  url       = {https://mlanthology.org/cvprw/2019/ghadai2019cvprw-multilevel/}
}