PointFusion: Deep Sensor Fusion for 3D Bounding Box Estimation

Xu, Danfei; Anguelov, Dragomir; Jain, Ashesh

doi:10.1109/CVPR.2018.00033

PointFusion: Deep Sensor Fusion for 3D Bounding Box Estimation

Danfei Xu, Dragomir Anguelov, Ashesh Jain

CVPR 2018

doi:10.1109/CVPR.2018.00033 /cvpr/2018/xu2018cvpr-pointfusion/

Abstract

We present PointFusion, a generic 3D object detection method that leverages both image and 3D point cloud information. Unlike existing methods that either use multi-stage pipelines or hold sensor and dataset-specific assumptions, PointFusion is conceptually simple and application-agnostic. The image data and the raw point cloud data are independently processed by a CNN and a PointNet architecture, respectively. The resulting outputs are then combined by a novel fusion network, which predicts multiple 3D box hypotheses and their confidences, using the input 3D points as spatial anchors. We evaluate PointFusion on two distinctive datasets: the KITTI dataset that features driving scenes captured with a lidar-camera setup, and the SUN-RGBD dataset that captures indoor environments with RGB-D cameras. Our model is the first one that is able to perform on par or better than the state-of-the-art on these diverse datasets without any dataset-specific model tuning.

PDF CVPR Semantic Scholar

Cite

Text

Xu et al. "PointFusion: Deep Sensor Fusion for 3D Bounding Box Estimation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018. doi:10.1109/CVPR.2018.00033

Markdown

[Xu et al. "PointFusion: Deep Sensor Fusion for 3D Bounding Box Estimation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018.](https://mlanthology.org/cvpr/2018/xu2018cvpr-pointfusion/) doi:10.1109/CVPR.2018.00033

BibTeX

@inproceedings{xu2018cvpr-pointfusion,
  title     = {{PointFusion: Deep Sensor Fusion for 3D Bounding Box Estimation}},
  author    = {Xu, Danfei and Anguelov, Dragomir and Jain, Ashesh},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2018},
  doi       = {10.1109/CVPR.2018.00033},
  url       = {https://mlanthology.org/cvpr/2018/xu2018cvpr-pointfusion/}
}