Robust Multi-Resolution Pedestrian Detection in Traffic Scenes
Abstract
The serious performance decline with decreasing resolution is the major bottleneck for current pedestrian detection techniques [14, 23]. In this paper, we take pedestrian detection in different resolutions as different but related problems, and propose a Multi-Task model to jointly consider their commonness and differences. The model contains resolution aware transformations to map pedestrians in different resolutions to a common space, where a shared detector is constructed to distinguish pedestrians from background. For model learning, we present a coordinate descent procedure to learn the resolution aware transformations and deformable part model (DPM) based detector iteratively. In traffic scenes, there are many false positives located around vehicles, therefore, we further build a context model to suppress them according to the pedestrian-vehicle relationship. The context model can be learned automatically even when the vehicle annotations are not available. Our method reduces the mean miss rate to 60% for pedestrians taller than 30 pixels on the Caltech Pedestrian Benchmark, which noticeably outperforms previous state-of-the-art (71%).
Cite
Text
Yan et al. "Robust Multi-Resolution Pedestrian Detection in Traffic Scenes." Conference on Computer Vision and Pattern Recognition, 2013. doi:10.1109/CVPR.2013.390Markdown
[Yan et al. "Robust Multi-Resolution Pedestrian Detection in Traffic Scenes." Conference on Computer Vision and Pattern Recognition, 2013.](https://mlanthology.org/cvpr/2013/yan2013cvpr-robust/) doi:10.1109/CVPR.2013.390BibTeX
@inproceedings{yan2013cvpr-robust,
title = {{Robust Multi-Resolution Pedestrian Detection in Traffic Scenes}},
author = {Yan, Junjie and Zhang, Xucong and Lei, Zhen and Liao, Shengcai and Li, Stan Z.},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2013},
doi = {10.1109/CVPR.2013.390},
url = {https://mlanthology.org/cvpr/2013/yan2013cvpr-robust/}
}