Voxel or Pillar: Exploring Efficient Point Cloud Representation for 3D Object Detection
Abstract
Efficient representation of point clouds is fundamental for LiDAR-based 3D object detection. While recent grid-based detectors often encode point clouds into either voxels or pillars, the distinctions between these approaches remain underexplored. In this paper, we quantify the differences between the current encoding paradigms and highlight the limited vertical learning within. To tackle these limitations, we propose a hybrid detection framework named Voxel-Pillar Fusion (VPF), which synergistically combines the unique strengths of both voxels and pillars. To be concrete, we first develop a sparse voxel-pillar encoder that encodes point clouds into voxel and pillar features through 3D and 2D sparse convolutions respectively, and then introduce the Sparse Fusion Layer (SFL), facilitating bidirectional interaction between sparse voxel and pillar features. Our computationally efficient, fully sparse method can be seamlessly integrated into both dense and sparse detectors. Leveraging this powerful yet straightforward representation, VPF delivers competitive performance, achieving real-time inference speeds on the nuScenes and Waymo Open Dataset.
Cite
Text
Huang et al. "Voxel or Pillar: Exploring Efficient Point Cloud Representation for 3D Object Detection." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I3.28018Markdown
[Huang et al. "Voxel or Pillar: Exploring Efficient Point Cloud Representation for 3D Object Detection." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/huang2024aaai-voxel/) doi:10.1609/AAAI.V38I3.28018BibTeX
@inproceedings{huang2024aaai-voxel,
title = {{Voxel or Pillar: Exploring Efficient Point Cloud Representation for 3D Object Detection}},
author = {Huang, Yuhao and Zhou, Sanping and Zhang, Junjie and Dong, Jinpeng and Zheng, Nanning},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2024},
pages = {2426-2435},
doi = {10.1609/AAAI.V38I3.28018},
url = {https://mlanthology.org/aaai/2024/huang2024aaai-voxel/}
}