How Far Are We from Solving Pedestrian Detection?
Abstract
Encouraged by the recent progress in pedestrian detection, we investigate the gap between current state-of-the-art methods and the "perfect single frame detector". We enable our analysis by creating a human baseline for pedestrian detection (over the Caltech dataset), and by manually clustering the recurrent errors of a top detector. Our results characterise both localisation and background-versus-foreground errors. To address localisation errors we study the impact of training annotation noise on the detector performance, and show that we can improve even with a small portion of sanitised training data. To address background/foreground discrimination, we study convnets for pedestrian detection, and discuss which factors affect their performance. Other than our in-depth analysis, we report top performance on the Caltech dataset, and provide a new sanitised set of training and test annotations.
Cite
Text
Zhang et al. "How Far Are We from Solving Pedestrian Detection?." Conference on Computer Vision and Pattern Recognition, 2016. doi:10.1109/CVPR.2016.141Markdown
[Zhang et al. "How Far Are We from Solving Pedestrian Detection?." Conference on Computer Vision and Pattern Recognition, 2016.](https://mlanthology.org/cvpr/2016/zhang2016cvpr-far/) doi:10.1109/CVPR.2016.141BibTeX
@inproceedings{zhang2016cvpr-far,
title = {{How Far Are We from Solving Pedestrian Detection?}},
author = {Zhang, Shanshan and Benenson, Rodrigo and Omran, Mohamed and Hosang, Jan and Schiele, Bernt},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2016},
doi = {10.1109/CVPR.2016.141},
url = {https://mlanthology.org/cvpr/2016/zhang2016cvpr-far/}
}