Joint Deep Learning for Pedestrian Detection
Abstract
Feature extraction, deformation handling, occlusion handling, and classi?cation are four important components in pedestrian detection. Existing methods learn or design these components either individually or sequentially. The interaction among these components is not yet well explored. This paper proposes that they should be jointly learned in order to maximize their strengths through cooperation. We formulate these four components into a joint deep learning framework and propose a new deep network architecture 1 . By establishing automatic, mutual interaction among components, the deep model achieves a 9% reduction in the average miss rate compared with the current best-performing pedestrian detection approaches on the largest Caltech benchmark dataset.
Cite
Text
Ouyang and Wang. "Joint Deep Learning for Pedestrian Detection." International Conference on Computer Vision, 2013. doi:10.1109/ICCV.2013.257Markdown
[Ouyang and Wang. "Joint Deep Learning for Pedestrian Detection." International Conference on Computer Vision, 2013.](https://mlanthology.org/iccv/2013/ouyang2013iccv-joint/) doi:10.1109/ICCV.2013.257BibTeX
@inproceedings{ouyang2013iccv-joint,
title = {{Joint Deep Learning for Pedestrian Detection}},
author = {Ouyang, Wanli and Wang, Xiaogang},
booktitle = {International Conference on Computer Vision},
year = {2013},
doi = {10.1109/ICCV.2013.257},
url = {https://mlanthology.org/iccv/2013/ouyang2013iccv-joint/}
}