SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation
Abstract
This paper aims at achieving fine-grained building attribute segmentation in a cross-view scenario i.e. using satellite and street-view image pairs. The main challenge lies in overcoming the significant perspective differences between street views and satellite views. In this work we introduce SG-BEV a novel approach for satellite-guided BEV fusion for cross-view semantic segmentation. To overcome the limitations of existing cross-view projection methods in capturing the complete building facade features we innovatively incorporate Bird's Eye View (BEV) method to establish a spatially explicit mapping of street-view features. Moreover we fully leverage the advantages of multiple perspectives by introducing a novel satellite-guided reprojection module optimizing the uneven feature distribution issues associated with traditional BEV methods. Our method demonstrates significant improvements on four cross-view datasets collected from multiple cities including New York San Francisco and Boston. On average across these datasets our method achieves an increase in mIOU by 10.13% and 5.21% compared with the state-of-the-art satellite-based and cross-view methods. The code and datasets of this work will be released at https://github.com/sysu-liweijia-lab/SG-BEV.
Cite
Text
Ye et al. "SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.02621Markdown
[Ye et al. "SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/ye2024cvpr-sgbev/) doi:10.1109/CVPR52733.2024.02621BibTeX
@inproceedings{ye2024cvpr-sgbev,
title = {{SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation}},
author = {Ye, Junyan and Luo, Qiyan and Yu, Jinhua and Zhong, Huaping and Zheng, Zhimeng and He, Conghui and Li, Weijia},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2024},
pages = {27748-27757},
doi = {10.1109/CVPR52733.2024.02621},
url = {https://mlanthology.org/cvpr/2024/ye2024cvpr-sgbev/}
}