AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-Level Retrieval
Abstract
This work investigates the problem of instance-level image retrieval re-ranking with the constraint of memory efficiency, ultimately aiming to limit memory usage to 1KB per image. Departing from the prevalent focus on performance enhancements, this work prioritizes the crucial trade-off between performance and memory requirements. The proposed model uses a transformer-based architecture designed to estimate image-to-image similarity by capturing interactions within and across images based on their local descriptors. A distinctive property of the model is the capability for asymmetric similarity estimation. Database images are represented with a smaller number of descriptors compared to query images, enabling performance improvements without increasing memory consumption. To ensure adaptability across different applications, a universal model is introduced that adjusts to a varying number of local descriptors during the testing phase. Results on standard benchmarks demonstrate the superiority of our approach over both hand-crafted and learned models. In particular, compared with current state-of-the-art methods that overlook their memory footprint, our approach not only attains superior performance but does so with a significantly reduced memory footprint. The code and pretrained models are publicly available at: https://github.com/pavelsuma/ames
Cite
Text
Suma et al. "AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-Level Retrieval." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73202-7_18Markdown
[Suma et al. "AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-Level Retrieval." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/suma2024eccv-ames/) doi:10.1007/978-3-031-73202-7_18BibTeX
@inproceedings{suma2024eccv-ames,
title = {{AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-Level Retrieval}},
author = {Suma, Pavel and Kordopatis-Zilos, Giorgos and Iscen, Ahmet and Tolias, Giorgos},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2024},
doi = {10.1007/978-3-031-73202-7_18},
url = {https://mlanthology.org/eccv/2024/suma2024eccv-ames/}
}