Nested Sparse Quantization for Efficient Feature Coding
Abstract
Many state-of-the-art methods in object recognition extract features from an image and encode them, followed by a pooling step and classification. Within this processing pipeline, often the encoding step is the bottleneck, for both computational efficiency and performance. We present a novel assignment-based encoding formulation. It allows for the fusion of assignment-based encoding and sparse coding into one formulation. We also use this to design a new, very efficient, encoding. At the heart of our formulation lies a quantization into a set of k -sparse vectors, which we denote as sparse quantization. We design the new encoding as two nested, sparse quantizations. Its efficiency stems from leveraging bit-wise representations. In a series of experiments on standard recognition benchmarks, namely Caltech 101, PASCAL VOC 07 and ImageNet, we demonstrate that our method achieves results that are competitive with the state-of-the-art, and requires orders of magnitude less time and memory. Our method is able to encode one million images using 4 CPUs in a single day, while maintaining a good performance.
Cite
Text
Boix et al. "Nested Sparse Quantization for Efficient Feature Coding." European Conference on Computer Vision, 2012. doi:10.1007/978-3-642-33709-3_53Markdown
[Boix et al. "Nested Sparse Quantization for Efficient Feature Coding." European Conference on Computer Vision, 2012.](https://mlanthology.org/eccv/2012/boix2012eccv-nested/) doi:10.1007/978-3-642-33709-3_53BibTeX
@inproceedings{boix2012eccv-nested,
title = {{Nested Sparse Quantization for Efficient Feature Coding}},
author = {Boix, Xavier and Roig, Gemma and Leistner, Christian and Van Gool, Luc},
booktitle = {European Conference on Computer Vision},
year = {2012},
pages = {744-758},
doi = {10.1007/978-3-642-33709-3_53},
url = {https://mlanthology.org/eccv/2012/boix2012eccv-nested/}
}