Multi-Level 3D CNN for Learning Multi-Scale Spatial Features
Abstract
3D object recognition accuracy can be improved by learning the multi-scale spatial features from 3D spatial geometric representations of objects such as point clouds, 3D models, surfaces, and RGB-D data. Current deep learning approaches learn such features either using structured data representations (voxel grids and octrees) or from unstructured representations (graphs and point clouds). Learning features from such structured representations is limited by the restriction on resolution and tree depth while unstructured representations creates a challenge due to non-uniformity among data samples. In this paper, we propose an end-to-end multi-level learning approach on a multi-level voxel grid to overcome these drawbacks. To demonstrate the utility of the proposed multi-level learning, we use a multi-level voxel representation of 3D objects to perform object recognition. The multi-level voxel representation consists of a coarse voxel grid that contains volumetric information of the 3D object. In addition, each voxel in the coarse grid that contains a portion of the object boundary is subdivided into multiple fine-level voxel grids. The performance of our multi-level learning algorithm for object recognition is comparable to dense voxel representations while using significantly lower memory.
Cite
Text
Ghadai et al. "Multi-Level 3D CNN for Learning Multi-Scale Spatial Features." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2019. doi:10.1109/CVPRW.2019.00150Markdown
[Ghadai et al. "Multi-Level 3D CNN for Learning Multi-Scale Spatial Features." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2019.](https://mlanthology.org/cvprw/2019/ghadai2019cvprw-multilevel/) doi:10.1109/CVPRW.2019.00150BibTeX
@inproceedings{ghadai2019cvprw-multilevel,
title = {{Multi-Level 3D CNN for Learning Multi-Scale Spatial Features}},
author = {Ghadai, Sambit and Lee, Xian Yeow and Balu, Aditya and Sarkar, Soumik and Krishnamurthy, Adarsh},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2019},
pages = {1152-1156},
doi = {10.1109/CVPRW.2019.00150},
url = {https://mlanthology.org/cvprw/2019/ghadai2019cvprw-multilevel/}
}