Vocabulary Hierarchy Optimization for Effective and Transferable Retrieval
Abstract
Scalable image retrieval systems usually involve hierarchical quantization of local image descriptors, which produces a visual vocabulary for inverted indexing of images. Although hierarchical quantization has the merit of retrieval efficiency, the resulting visual vocabulary representation usually faces two crucial problems: (1) hierarchical quantization errors and biases in the generation of "visual words"; (2) the model cannot adapt to database variance. In this paper, we describe an unsupervised optimization strategy in generating the hierarchy structure of visual vocabulary, which produces a more effective and adaptive retrieval model for large-scale search. We adopt a novel Density-based Metric Learning (DML) algorithm, which corrects word quantization bias without supervision in hierarchy optimization, based on which we present a hierarchical rejection chain for efficient online search based on the vocabulary hierarchy. We also discovered that by hierarchy optimization, efficient and effective transfer of a retrieval model across different databases is feasible. We deployed a large-scale image retrieval system using a vocabulary tree model to validate our advances. Experiments on UKBench and street-side urban scene databases demonstrated the effectiveness of our hierarchy optimization approach in comparison with state-of-the-art methods.
Cite
Text
Ji et al. "Vocabulary Hierarchy Optimization for Effective and Transferable Retrieval." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2009. doi:10.1109/CVPR.2009.5206680Markdown
[Ji et al. "Vocabulary Hierarchy Optimization for Effective and Transferable Retrieval." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2009.](https://mlanthology.org/cvpr/2009/ji2009cvpr-vocabulary/) doi:10.1109/CVPR.2009.5206680BibTeX
@inproceedings{ji2009cvpr-vocabulary,
title = {{Vocabulary Hierarchy Optimization for Effective and Transferable Retrieval}},
author = {Ji, Rongrong and Xie, Xing and Yao, Hongxun and Ma, Wei-Ying},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2009},
pages = {1161-1168},
doi = {10.1109/CVPR.2009.5206680},
url = {https://mlanthology.org/cvpr/2009/ji2009cvpr-vocabulary/}
}