SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation

Abstract

This paper aims at achieving fine-grained building attribute segmentation in a cross-view scenario i.e. using satellite and street-view image pairs. The main challenge lies in overcoming the significant perspective differences between street views and satellite views. In this work we introduce SG-BEV a novel approach for satellite-guided BEV fusion for cross-view semantic segmentation. To overcome the limitations of existing cross-view projection methods in capturing the complete building facade features we innovatively incorporate Bird's Eye View (BEV) method to establish a spatially explicit mapping of street-view features. Moreover we fully leverage the advantages of multiple perspectives by introducing a novel satellite-guided reprojection module optimizing the uneven feature distribution issues associated with traditional BEV methods. Our method demonstrates significant improvements on four cross-view datasets collected from multiple cities including New York San Francisco and Boston. On average across these datasets our method achieves an increase in mIOU by 10.13% and 5.21% compared with the state-of-the-art satellite-based and cross-view methods. The code and datasets of this work will be released at https://github.com/sysu-liweijia-lab/SG-BEV.

Cite

Text

Ye et al. "SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.02621

Markdown

[Ye et al. "SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/ye2024cvpr-sgbev/) doi:10.1109/CVPR52733.2024.02621

BibTeX

@inproceedings{ye2024cvpr-sgbev,
  title     = {{SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation}},
  author    = {Ye, Junyan and Luo, Qiyan and Yu, Jinhua and Zhong, Huaping and Zheng, Zhimeng and He, Conghui and Li, Weijia},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2024},
  pages     = {27748-27757},
  doi       = {10.1109/CVPR52733.2024.02621},
  url       = {https://mlanthology.org/cvpr/2024/ye2024cvpr-sgbev/}
}