ETR: An Efficient Transformer for Re-Ranking in Visual Place Recognition
Abstract
Visual place recognition is to estimate the geographical location of a given image, which is usually addressed by recognizing its similar reference images from a database. The reference images are usually retrieved via similarity search using global descriptor, and the local descriptors are used to re-rank the initial retrieved candidates. The local descriptors re-ranking can significantly improve the accuracy of global retrieval but comes at a high computational cost. To achieve a good trade-off between accuracy and efficiency, we propose an Efficient Transformer for Re-ranking (ETR), utilizing both global and local descriptors to re-rank the top candidates in a single shot. In contrast to traditional re-ranking methods, we leverage self-attention to capture relationships between local descriptors in a single image and cross-attention to explore the similarity of the image pairs. We show that the proposed model can be regarded as a general re-ranking algorithm for significantly boosting the performance of other global-only retrieval methods. Extensive experimental results show that our method outperforms state-of-the-arts and is orders of magnitude faster in terms of computational efficiency.
Cite
Text
Zhang et al. "ETR: An Efficient Transformer for Re-Ranking in Visual Place Recognition." Winter Conference on Applications of Computer Vision, 2023.Markdown
[Zhang et al. "ETR: An Efficient Transformer for Re-Ranking in Visual Place Recognition." Winter Conference on Applications of Computer Vision, 2023.](https://mlanthology.org/wacv/2023/zhang2023wacv-etr/)BibTeX
@inproceedings{zhang2023wacv-etr,
title = {{ETR: An Efficient Transformer for Re-Ranking in Visual Place Recognition}},
author = {Zhang, Hao and Chen, Xin and Jing, Heming and Zheng, Yingbin and Wu, Yuan and Jin, Cheng},
booktitle = {Winter Conference on Applications of Computer Vision},
year = {2023},
pages = {5665-5674},
url = {https://mlanthology.org/wacv/2023/zhang2023wacv-etr/}
}