CityPersons: A Diverse Dataset for Pedestrian Detection

Abstract

Convnets have enabled significant progress in pedestrian detection recently, but there are still open questions regard- ing suitable architectures and training data. We revisit CNN design and point out key adaptations, enabling plain Fas- terRCNN to obtain state-of-the-art results on the Caltech dataset. To achieve further improvement from more and better data, we introduce CityPersons, a new set of person annotations on top of the Cityscapes dataset. The di- versity of CityPersons allows us for the first time to train one single CNN model that generalizes well over mul- tiple benchmarks. Moreover, with additional training with CityPersons, we obtain top results using FasterRCNN on Caltech, improving especially for more difficult cases (heavy occlusion and small scale) and providing higher loc- alization quality.

Cite

Text

Zhang et al. "CityPersons: A Diverse Dataset for Pedestrian Detection." Conference on Computer Vision and Pattern Recognition, 2017. doi:10.1109/CVPR.2017.474

Markdown

[Zhang et al. "CityPersons: A Diverse Dataset for Pedestrian Detection." Conference on Computer Vision and Pattern Recognition, 2017.](https://mlanthology.org/cvpr/2017/zhang2017cvpr-citypersons/) doi:10.1109/CVPR.2017.474

BibTeX

@inproceedings{zhang2017cvpr-citypersons,
  title     = {{CityPersons: A Diverse Dataset for Pedestrian Detection}},
  author    = {Zhang, Shanshan and Benenson, Rodrigo and Schiele, Bernt},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2017},
  doi       = {10.1109/CVPR.2017.474},
  url       = {https://mlanthology.org/cvpr/2017/zhang2017cvpr-citypersons/}
}