Learning Coarse-to-Fine Sparselets for Efficient Object Detection and Scene Classification

Cheng, Gong; Han, Junwei; Guo, Lei; Liu, Tianming

doi:10.1109/CVPR.2015.7298721

Learning Coarse-to-Fine Sparselets for Efficient Object Detection and Scene Classification

Gong Cheng, Junwei Han, Lei Guo, Tianming Liu

CVPR 2015

doi:10.1109/CVPR.2015.7298721 /cvpr/2015/cheng2015cvpr-learning/

Abstract

Part model-based methods have been successfully applied to object detection and scene classification and have achieved state-of-the-art results. More recently the "sparselets" work [1-3] were introduced to serve as a universal set of shared basis learned from a large number of part detectors, resulting in notable speedup. Inspired by this framework, in this paper, we propose a novel scheme to train more effective sparselets with a coarse-to-fine framework. Specifically, we first train coarse sparselets to exploit the redundancy existing among part detectors by using an unsupervised single-hidden layer auto-encoder. Then, we simultaneously train fine sparselets and activation vectors using a supervised single-hidden-layer neural network, in which sparselets training and discriminative activation vectors learning are jointly embedded into a unified framework. In order to adequately explore the discriminative information hidden in the part detectors and to achieve sparsity, we propose to optimize a new discriminative objective function by imposing L0-norm sparsity constraint on the activation vectors. By using the proposed framework, promising results for multi-class object detection and scene classification are achieved on PASCAL VOC 2007, MIT Scene-67, and UC Merced Land Use datasets, compared with the existing sparselets baseline methods.

PDF CVPR Semantic Scholar

Cite

Text

Cheng et al. "Learning Coarse-to-Fine Sparselets for Efficient Object Detection and Scene Classification." Conference on Computer Vision and Pattern Recognition, 2015. doi:10.1109/CVPR.2015.7298721

Markdown

[Cheng et al. "Learning Coarse-to-Fine Sparselets for Efficient Object Detection and Scene Classification." Conference on Computer Vision and Pattern Recognition, 2015.](https://mlanthology.org/cvpr/2015/cheng2015cvpr-learning/) doi:10.1109/CVPR.2015.7298721

BibTeX

@inproceedings{cheng2015cvpr-learning,
  title     = {{Learning Coarse-to-Fine Sparselets for Efficient Object Detection and Scene Classification}},
  author    = {Cheng, Gong and Han, Junwei and Guo, Lei and Liu, Tianming},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2015},
  doi       = {10.1109/CVPR.2015.7298721},
  url       = {https://mlanthology.org/cvpr/2015/cheng2015cvpr-learning/}
}