MCIP: Multi-Stream Network for Pedestrian Crossing Intention Prediction

Abstract

Predicting the crossing intention of pedestrian is an essential task for autonomous driving systems. Whether or not a pedestrian will cross a crosswalk is a significantly inevitable skills for safety driving. Although many datasets and models are proposed to precisely predict the intention of pedestrian, they lack the ability to integrate different types of information. Therefore, we propose a Multi-Stream Network for Pedestrian Crossing Intention Prediction (MCIP) based on our novel optimal merging method. The proposed method consists of integration modules that takes two visual and three non-visual elements as an input. We achieved state-of-the-art performance on accuracy of pedestrian crossing intention, F1-score, and AUC with both public standard pedestrian datasets, PIE and JAAD. Furthermore, we compared the performance of our MCIP with other networks quantitatively by visualizing the intention of the pedestrian. Lastly, we performed ablation studies to observe the effectiveness of our multi-stream methods.

Cite

Text

Ham et al. "MCIP: Multi-Stream Network for Pedestrian Crossing Intention Prediction." European Conference on Computer Vision Workshops, 2022. doi:10.1007/978-3-031-25056-9_42

Markdown

[Ham et al. "MCIP: Multi-Stream Network for Pedestrian Crossing Intention Prediction." European Conference on Computer Vision Workshops, 2022.](https://mlanthology.org/eccvw/2022/ham2022eccvw-mcip/) doi:10.1007/978-3-031-25056-9_42

BibTeX

@inproceedings{ham2022eccvw-mcip,
  title     = {{MCIP: Multi-Stream Network for Pedestrian Crossing Intention Prediction}},
  author    = {Ham, Je-Seok and Bae, Kangmin and Moon, Jinyoung},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2022},
  pages     = {663-679},
  doi       = {10.1007/978-3-031-25056-9_42},
  url       = {https://mlanthology.org/eccvw/2022/ham2022eccvw-mcip/}
}