Switchable Deep Network for Pedestrian Detection
Abstract
In this paper, we propose a Switchable Deep Network (SDN) for pedestrian detection. The SDN automatically learns hierarchical features, salience maps, and mixture representations of different body parts. Pedestrian detection faces the challenges of background clutter and large variations of pedestrian appearance due to pose and viewpoint changes and other factors. One of our key contributions is to propose a Switchable Restricted Boltzmann Machine (SRBM) to explicitly model the complex mixture of visual variations at multiple levels. At the feature levels, it automatically estimates saliency maps for each test sample in order to separate background clutters from discriminative regions for pedestrian detection. At the part and body levels, it is able to infer the most appropriate template for the mixture models of each part and the whole body. We have devised a new generative algorithm to effectively pretrain the SDN and then fine-tune it with back-propagation. Our approach is evaluated on the Caltech and ETH datasets and achieves the state-of-the-art detection performance.
Cite
Text
Luo et al. "Switchable Deep Network for Pedestrian Detection." Conference on Computer Vision and Pattern Recognition, 2014. doi:10.1109/CVPR.2014.120Markdown
[Luo et al. "Switchable Deep Network for Pedestrian Detection." Conference on Computer Vision and Pattern Recognition, 2014.](https://mlanthology.org/cvpr/2014/luo2014cvpr-switchable/) doi:10.1109/CVPR.2014.120BibTeX
@inproceedings{luo2014cvpr-switchable,
title = {{Switchable Deep Network for Pedestrian Detection}},
author = {Luo, Ping and Tian, Yonglong and Wang, Xiaogang and Tang, Xiaoou},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2014},
doi = {10.1109/CVPR.2014.120},
url = {https://mlanthology.org/cvpr/2014/luo2014cvpr-switchable/}
}