Vocabulary Hierarchy Optimization for Effective and Transferable Retrieval

Abstract

Scalable image retrieval systems usually involve hierarchical quantization of local image descriptors, which produces a visual vocabulary for inverted indexing of images. Although hierarchical quantization has the merit of retrieval efficiency, the resulting visual vocabulary representation usually faces two crucial problems: (1) hierarchical quantization errors and biases in the generation of "visual words"; (2) the model cannot adapt to database variance. In this paper, we describe an unsupervised optimization strategy in generating the hierarchy structure of visual vocabulary, which produces a more effective and adaptive retrieval model for large-scale search. We adopt a novel Density-based Metric Learning (DML) algorithm, which corrects word quantization bias without supervision in hierarchy optimization, based on which we present a hierarchical rejection chain for efficient online search based on the vocabulary hierarchy. We also discovered that by hierarchy optimization, efficient and effective transfer of a retrieval model across different databases is feasible. We deployed a large-scale image retrieval system using a vocabulary tree model to validate our advances. Experiments on UKBench and street-side urban scene databases demonstrated the effectiveness of our hierarchy optimization approach in comparison with state-of-the-art methods.

Cite

Text

Ji et al. "Vocabulary Hierarchy Optimization for Effective and Transferable Retrieval." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2009. doi:10.1109/CVPR.2009.5206680

Markdown

[Ji et al. "Vocabulary Hierarchy Optimization for Effective and Transferable Retrieval." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2009.](https://mlanthology.org/cvpr/2009/ji2009cvpr-vocabulary/) doi:10.1109/CVPR.2009.5206680

BibTeX

@inproceedings{ji2009cvpr-vocabulary,
  title     = {{Vocabulary Hierarchy Optimization for Effective and Transferable Retrieval}},
  author    = {Ji, Rongrong and Xie, Xing and Yao, Hongxun and Ma, Wei-Ying},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2009},
  pages     = {1161-1168},
  doi       = {10.1109/CVPR.2009.5206680},
  url       = {https://mlanthology.org/cvpr/2009/ji2009cvpr-vocabulary/}
}