Characterizing Scattered Occlusions for Effective Dense-Mode Crowd Counting

Abstract

We propose a novel deep learning approach for effective dense crowd counting by characterizing scattered occlusions, named CSONet. CSONet recognizes the implications of event-induced, scene-embedded, and multitudinous obstacles such as umbrellas and picket signs to achieve an accurate crowd analysis result. CSONet is the first deep learning model for characterizing scattered occlusions of effective dense-mode crowd counting to the best of our knowledge. We have collected and annotated two new scattered occlusion object datasets, which contain crowd images occluded with umbrellas (csoumbrellas dataset) and picket signs (cso-pickets dataset). We have designed and implemented a new crowd overfit reduction network by adding both spatial pyramid pooling and dilated convolution layers over modified VGG16 for capturing high-level features of extended receptive fields. CSONet was trained on the two new scattered occlusion datasets and the ShanghaiTech A and B datasets. We also have built an algorithm that merges scattered object maps and density heatmaps of visible humans to generate a more accurate crowd density heatmap output. Through extensive evaluations, we demonstrate that the accuracy of CSONet with scattered occlusion images outperforms over the state-of-art existing crowd counting approaches by 30% to 100% in both mean absolute error and mean square error.

Cite

Text

Almalki et al. "Characterizing Scattered Occlusions for Effective Dense-Mode Crowd Counting." IEEE/CVF International Conference on Computer Vision Workshops, 2021. doi:10.1109/ICCVW54120.2021.00428

Markdown

[Almalki et al. "Characterizing Scattered Occlusions for Effective Dense-Mode Crowd Counting." IEEE/CVF International Conference on Computer Vision Workshops, 2021.](https://mlanthology.org/iccvw/2021/almalki2021iccvw-characterizing/) doi:10.1109/ICCVW54120.2021.00428

BibTeX

@inproceedings{almalki2021iccvw-characterizing,
  title     = {{Characterizing Scattered Occlusions for Effective Dense-Mode Crowd Counting}},
  author    = {Almalki, Khalid J. and Choi, Baek-Young and Chen, Yu and Song, Sejun},
  booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
  year      = {2021},
  pages     = {3833-3842},
  doi       = {10.1109/ICCVW54120.2021.00428},
  url       = {https://mlanthology.org/iccvw/2021/almalki2021iccvw-characterizing/}
}