Transform Coding for Fast Approximate Nearest Neighbor Search in High Dimensions
Abstract
We examine the problem of large scale nearest neighbor search in high dimensional spaces and propose a new approach based on the close relationship between nearest neighbor search and that of signal representation and quantization. Our contribution is a very simple and efficient quantization technique using transform coding and product quantization. We demonstrate its effectiveness in several settings, including large-scale retrieval, nearest neighbor classification, feature matching, and similarity search based on the bag-of-words representation. Through experiments on standard data sets we show it is competitive with state-of-the-art methods, with greater speed, simplicity, and generality. The resulting compact representation can be the basis for more elaborate hierarchical search structures for sub-linear approximate search. However, we demonstrate that optimized linear search using the quantized representation is extremely fast and trivially parallelizable on modern computer architectures, with further acceleration possible by way of GPU implementation.
Cite
Text
Brandt. "Transform Coding for Fast Approximate Nearest Neighbor Search in High Dimensions." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2010. doi:10.1109/CVPR.2010.5539852Markdown
[Brandt. "Transform Coding for Fast Approximate Nearest Neighbor Search in High Dimensions." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2010.](https://mlanthology.org/cvpr/2010/brandt2010cvpr-transform/) doi:10.1109/CVPR.2010.5539852BibTeX
@inproceedings{brandt2010cvpr-transform,
title = {{Transform Coding for Fast Approximate Nearest Neighbor Search in High Dimensions}},
author = {Brandt, Jonathan},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2010},
pages = {1815-1822},
doi = {10.1109/CVPR.2010.5539852},
url = {https://mlanthology.org/cvpr/2010/brandt2010cvpr-transform/}
}