Fast Approximations to Structured Sparse Coding and Applications to Object Classification
Abstract
We describe a method for fast approximation of sparse coding. A given input vector is passed through a binary tree. Each leaf of the tree contains a subset of dictionary elements. The coefficients corresponding to these dictionary elements are allowed to be nonzero and their values are calculated quickly by multiplication with a precomputed pseudoinverse. The tree parameters, the dictionary, and the subsets of the dictionary corresponding to each leaf are learned. In the process of describing this algorithm, we discuss the more general problem of learning the groups in group structured sparse modeling. We show that our method creates good sparse representations by using it in the object recognition framework of [1,2]. Implementing our own fast version of the SIFT descriptor the whole system runs at 20 frames per second on 321 ×481 sized images on a laptop with a quad-core cpu, while sacrificing very little accuracy on the Caltech 101, Caltech 256, and 15 scenes benchmarks.
Cite
Text
Szlam et al. "Fast Approximations to Structured Sparse Coding and Applications to Object Classification." European Conference on Computer Vision, 2012. doi:10.1007/978-3-642-33715-4_15Markdown
[Szlam et al. "Fast Approximations to Structured Sparse Coding and Applications to Object Classification." European Conference on Computer Vision, 2012.](https://mlanthology.org/eccv/2012/szlam2012eccv-fast/) doi:10.1007/978-3-642-33715-4_15BibTeX
@inproceedings{szlam2012eccv-fast,
title = {{Fast Approximations to Structured Sparse Coding and Applications to Object Classification}},
author = {Szlam, Arthur and Gregor, Karol and LeCun, Yann},
booktitle = {European Conference on Computer Vision},
year = {2012},
pages = {200-213},
doi = {10.1007/978-3-642-33715-4_15},
url = {https://mlanthology.org/eccv/2012/szlam2012eccv-fast/}
}