Taking a Deeper Look at Pedestrians

Abstract

The only goal of the abstract is to the answer the question: why should I read this paper? In this paper we study the use of convolutional neural networks (convnets) for the task of pedestrian detection. Despite their recent diverse successes, convnets historically underperform compared to other pedestrian detectors. We deliberately omit explicitly modelling the problem into the network (e.g. parts or occlusion modelling) and show that we can reach competitive performance without bells and whistles. In a wide range of experiments we analyse small and big convnets, their architectural choices, parameters, and the influence of different training data, including pre-training on surrogate tasks. We present the best convnet detectors on the Caltech and KITTI dataset. On Caltech our convnets reach top performance both for the Caltech1x and Caltech10x training setup. Using additional data at training time our strongest convnet model is competitive to detectors that use instead additional data at test time.

Cite

Text

Hosang et al. "Taking a Deeper Look at Pedestrians." Conference on Computer Vision and Pattern Recognition, 2015. doi:10.1109/CVPR.2015.7299034

Markdown

[Hosang et al. "Taking a Deeper Look at Pedestrians." Conference on Computer Vision and Pattern Recognition, 2015.](https://mlanthology.org/cvpr/2015/hosang2015cvpr-taking/) doi:10.1109/CVPR.2015.7299034

BibTeX

@inproceedings{hosang2015cvpr-taking,
  title     = {{Taking a Deeper Look at Pedestrians}},
  author    = {Hosang, Jan and Omran, Mohamed and Benenson, Rodrigo and Schiele, Bernt},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2015},
  doi       = {10.1109/CVPR.2015.7299034},
  url       = {https://mlanthology.org/cvpr/2015/hosang2015cvpr-taking/}
}