TopicGeo: An Efficient Unified Framework for Geolocation

Abstract

Vision-based geolocation techniques that establish spatial correspondences between smaller query images and larger georeferenced images have gained significant attention. Existing approaches typically employ a separate "retrieve-then-match" paradigm, whereas such paradigms suffer from computational inefficiency or precision limitations. To this end, we propose TopicGeo, a unified framework for direct and precise query-to-reference image matching via three key innovations. The textual object semantics, called topics, distilled from CLIP prompt learning are embedded into the geolocation framework to eliminate intra-class and inter-class distribution discrepancies while also enhancing processing efficiency. Center-based adaptive label assignment and outlier rejection mechanisms as a joint retrieval-matching optimization strategy ensure task-coherent feature learning and precise spatial correspondences. A multi-level fine matching pipeline is introduced to refine matching from quality and quantity. Evaluations on large-scale synthetic and real-world datasets illustrate that TopicGeo achieves state-of-the-art performance in retrieval recall and matching accuracy while maintaining a balance in computational efficiency.

Cite

Text

Wang et al. "TopicGeo: An Efficient Unified Framework for Geolocation." International Conference on Computer Vision, 2025.

Markdown

[Wang et al. "TopicGeo: An Efficient Unified Framework for Geolocation." International Conference on Computer Vision, 2025.](https://mlanthology.org/iccv/2025/wang2025iccv-topicgeo/)

BibTeX

@inproceedings{wang2025iccv-topicgeo,
  title     = {{TopicGeo: An Efficient Unified Framework for Geolocation}},
  author    = {Wang, Xin and Wang, Xinlin and Gou, Shuiping},
  booktitle = {International Conference on Computer Vision},
  year      = {2025},
  pages     = {8241-8251},
  url       = {https://mlanthology.org/iccv/2025/wang2025iccv-topicgeo/}
}