Multipath Sparse Coding Using Hierarchical Matching Pursuit

Abstract

Complex real-world signals, such as images, contain discriminative structures that differ in many aspects including scale, invariance, and data channel. While progress in deep learning shows the importance of learning features through multiple layers, it is equally important to learn features through multiple paths. We propose Multipath Hierarchical Matching Pursuit (M-HMP), a novel feature learning architecture that combines a collection of hierarchical sparse features for image classification to capture multiple aspects of discriminative structures. Our building blocks are MI-KSVD, a codebook learning algorithm that balances the reconstruction error and the mutual incoherence of the codebook, and batch orthogonal matching pursuit (OMP); we apply them recursively at varying layers and scales. The result is a highly discriminative image representation that leads to large improvements to the state-of-the-art on many standard benchmarks, e.g., Caltech-101, Caltech-256, MITScenes, Oxford-IIIT Pet and Caltech-UCSD Bird-200.

Cite

Text

Bo et al. "Multipath Sparse Coding Using Hierarchical Matching Pursuit." Conference on Computer Vision and Pattern Recognition, 2013. doi:10.1109/CVPR.2013.91

Markdown

[Bo et al. "Multipath Sparse Coding Using Hierarchical Matching Pursuit." Conference on Computer Vision and Pattern Recognition, 2013.](https://mlanthology.org/cvpr/2013/bo2013cvpr-multipath/) doi:10.1109/CVPR.2013.91

BibTeX

@inproceedings{bo2013cvpr-multipath,
  title     = {{Multipath Sparse Coding Using Hierarchical Matching Pursuit}},
  author    = {Bo, Liefeng and Ren, Xiaofeng and Fox, Dieter},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2013},
  doi       = {10.1109/CVPR.2013.91},
  url       = {https://mlanthology.org/cvpr/2013/bo2013cvpr-multipath/}
}