How Far Are We from Solving Pedestrian Detection?

Shanshan Zhang, Rodrigo Benenson, Mohamed Omran, Jan Hosang, Bernt Schiele

CVPR 2016

doi:10.1109/CVPR.2016.141 /cvpr/2016/zhang2016cvpr-far/

Abstract

Encouraged by the recent progress in pedestrian detection, we investigate the gap between current state-of-the-art methods and the "perfect single frame detector". We enable our analysis by creating a human baseline for pedestrian detection (over the Caltech dataset), and by manually clustering the recurrent errors of a top detector. Our results characterise both localisation and background-versus-foreground errors. To address localisation errors we study the impact of training annotation noise on the detector performance, and show that we can improve even with a small portion of sanitised training data. To address background/foreground discrimination, we study convnets for pedestrian detection, and discuss which factors affect their performance. Other than our in-depth analysis, we report top performance on the Caltech dataset, and provide a new sanitised set of training and test annotations.

PDF CVPR Semantic Scholar

Cite

Text

Zhang et al. "How Far Are We from Solving Pedestrian Detection?." Conference on Computer Vision and Pattern Recognition, 2016. doi:10.1109/CVPR.2016.141

Markdown

[Zhang et al. "How Far Are We from Solving Pedestrian Detection?." Conference on Computer Vision and Pattern Recognition, 2016.](https://mlanthology.org/cvpr/2016/zhang2016cvpr-far/) doi:10.1109/CVPR.2016.141

BibTeX

@inproceedings{zhang2016cvpr-far,
  title     = {{How Far Are We from Solving Pedestrian Detection?}},
  author    = {Zhang, Shanshan and Benenson, Rodrigo and Omran, Mohamed and Hosang, Jan and Schiele, Bernt},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2016},
  doi       = {10.1109/CVPR.2016.141},
  url       = {https://mlanthology.org/cvpr/2016/zhang2016cvpr-far/}
}