GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection
Abstract
Integrating LiDAR and camera information into Bird’s-Eye-View (BEV) representation has emerged as a crucial aspect of 3D object detection in autonomous driving. However, existing methods are susceptible to the inaccurate calibration relationship between LiDAR and the camera sensor. Such inaccuracies result in errors in depth estimation for the camera branch, ultimately causing misalignment between LiDAR and camera BEV features. In this work, we propose a robust fusion framework called GraphBEV. Addressing errors caused by inaccurate point cloud projection, we introduce a LocalAlign module that employs neighbor-aware depth features via Graph matching. Additionally, we propose a GlobalAlign module to rectify the misalignment between LiDAR and camera BEV features. Our GraphBEV framework achieves state-of-the-art performance, with an mAP of 70.1%, surpassing BEVFusion by 1.6% on the nuScenes validation set. Importantly, our GraphBEV outperforms BEVFusion by 8.3% under conditions with misalignment noise.
Cite
Text
Song et al. "GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73347-5_20Markdown
[Song et al. "GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/song2024eccv-graphbev/) doi:10.1007/978-3-031-73347-5_20BibTeX
@inproceedings{song2024eccv-graphbev,
title = {{GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection}},
author = {Song, Ziying and Yang, Lei and Xu, Shaoqing and Liu, Lin and Xu, Dongyang and Jia, Caiyan and Jia, Feiyang and Wang, Li},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2024},
doi = {10.1007/978-3-031-73347-5_20},
url = {https://mlanthology.org/eccv/2024/song2024eccv-graphbev/}
}