CVM-Net: Cross-View Matching Network for Image-Based Ground-to-Aerial Geo-Localization
Abstract
The problem of localization on a geo-referenced aerial/satellite map given a query ground view image remains challenging due to the drastic change in viewpoint that causes traditional image descriptors based matching to fail. We leverage on the recent success of deep learning to propose the CVM-Net for the cross-view image-based ground-to-aerial geo-localization task. Specifically, our network is based on the Siamese architecture to do metric learning for the matching task. We first use the fully convolutional layers to extract local image features, which are then encoded into global image descriptors using the powerful NetVLAD. As part of the training procedure, we also introduce a simple yet effective weighted soft margin ranking loss function that not only speeds up the training convergence but also improves the final matching accuracy. Experimental results show that our proposed network significantly outperforms the state-of-the-art approaches on two existing benchmarking datasets.
Cite
Text
Hu et al. "CVM-Net: Cross-View Matching Network for Image-Based Ground-to-Aerial Geo-Localization." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018. doi:10.1109/CVPR.2018.00758Markdown
[Hu et al. "CVM-Net: Cross-View Matching Network for Image-Based Ground-to-Aerial Geo-Localization." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018.](https://mlanthology.org/cvpr/2018/hu2018cvpr-cvmnet/) doi:10.1109/CVPR.2018.00758BibTeX
@inproceedings{hu2018cvpr-cvmnet,
title = {{CVM-Net: Cross-View Matching Network for Image-Based Ground-to-Aerial Geo-Localization}},
author = {Hu, Sixing and Feng, Mengdan and Nguyen, Rang M. H. and Lee, Gim Hee},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2018},
doi = {10.1109/CVPR.2018.00758},
url = {https://mlanthology.org/cvpr/2018/hu2018cvpr-cvmnet/}
}