ConGeo: Robust Cross-View Geo-Localization Across Ground View Variations
Abstract
∗ Equal contribution Corresponding author ([email protected]) Cross-view geo-localization aims at localizing a ground-level query image by matching it to its corresponding geo-referenced aerial view. In real-world scenarios, the task requires accommodating diverse ground images captured by users with varying orientations and reduced field of views (FoVs). However, existing learning pipelines are orientation-specific or FoV-specific, demanding separate model training for different ground view variations. Such models heavily depend on the North-aligned spatial correspondence and predefined FoVs in the training data, compromising their robustness across different settings. To tackle this challenge, we propose ConGeo, a single- and cross-view Contrastive method for Geo-localization: it enhances robustness and consistency in feature representations to improve a model’s invariance to orientation and its resilience to FoV variations, by enforcing proximity between ground view variations of the same location. As a generic learning objective for cross-view geo-localization, when integrated into state-of-the-art pipelines, ConGeo significantly boosts the performance of three base models on four geo-localization benchmarks for diverse ground view variations and outperforms competing methods that train separate models for each ground view variation.
Cite
Text
Mi et al. "ConGeo: Robust Cross-View Geo-Localization Across Ground View Variations." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-72630-9_13Markdown
[Mi et al. "ConGeo: Robust Cross-View Geo-Localization Across Ground View Variations." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/mi2024eccv-congeo/) doi:10.1007/978-3-031-72630-9_13BibTeX
@inproceedings{mi2024eccv-congeo,
title = {{ConGeo: Robust Cross-View Geo-Localization Across Ground View Variations}},
author = {Mi, Li and Xu, Chang and Navarro, Javiera Castillo and Montariol, Syrielle and Yang, Wen and Bosselut, Antoine and Tuia, Devis},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2024},
doi = {10.1007/978-3-031-72630-9_13},
url = {https://mlanthology.org/eccv/2024/mi2024eccv-congeo/}
}