How Far Are We from Solving Pedestrian Detection?

Abstract

Encouraged by the recent progress in pedestrian detection, we investigate the gap between current state-of-the-art methods and the "perfect single frame detector". We enable our analysis by creating a human baseline for pedestrian detection (over the Caltech dataset), and by manually clustering the recurrent errors of a top detector. Our results characterise both localisation and background-versus-foreground errors. To address localisation errors we study the impact of training annotation noise on the detector performance, and show that we can improve even with a small portion of sanitised training data. To address background/foreground discrimination, we study convnets for pedestrian detection, and discuss which factors affect their performance. Other than our in-depth analysis, we report top performance on the Caltech dataset, and provide a new sanitised set of training and test annotations.

Cite

Text

Zhang et al. "How Far Are We from Solving Pedestrian Detection?." Conference on Computer Vision and Pattern Recognition, 2016. doi:10.1109/CVPR.2016.141

Markdown

[Zhang et al. "How Far Are We from Solving Pedestrian Detection?." Conference on Computer Vision and Pattern Recognition, 2016.](https://mlanthology.org/cvpr/2016/zhang2016cvpr-far/) doi:10.1109/CVPR.2016.141

BibTeX

@inproceedings{zhang2016cvpr-far,
  title     = {{How Far Are We from Solving Pedestrian Detection?}},
  author    = {Zhang, Shanshan and Benenson, Rodrigo and Omran, Mohamed and Hosang, Jan and Schiele, Bernt},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2016},
  doi       = {10.1109/CVPR.2016.141},
  url       = {https://mlanthology.org/cvpr/2016/zhang2016cvpr-far/}
}