Fused DNN: A Deep Neural Network Fusion Approach to Fast and Robust Pedestrian Detection
Abstract
We propose a deep neural network fusion architecture for fast and robust pedestrian detection. The proposed network fusion architecture allows for parallel processing of multiple networks for speed. A single shot deep convolutional network is trained as a object detector to generate all possible pedestrian candidates of different sizes and occlusions. This network outputs a large variety of pedestrian candidates to cover the majority of ground-truth pedestrians while also introducing a large number of false positives. Next, multiple deep neural networks are used in parallel for further refinement of these pedestrian candidates. We introduce a soft-rejection based network fusion method to fuse the soft metrics from all networks together to generate the final confidence scores. Our method performs better than existing state-of-the-arts, especially when detecting small-size and occluded pedestrians. Furthermore, we propose a method for integrating pixel-wise semantic segmentation network into the network fusion architecture as a reinforcement to the pedestrian detector. The approach outperforms state-of-the-art methods on most protocols on Caltech Pedestrian dataset, with significant boosts on several protocols. It is also faster than all other methods.
Cite
Text
Du et al. "Fused DNN: A Deep Neural Network Fusion Approach to Fast and Robust Pedestrian Detection." IEEE/CVF Winter Conference on Applications of Computer Vision, 2017. doi:10.1109/WACV.2017.111Markdown
[Du et al. "Fused DNN: A Deep Neural Network Fusion Approach to Fast and Robust Pedestrian Detection." IEEE/CVF Winter Conference on Applications of Computer Vision, 2017.](https://mlanthology.org/wacv/2017/du2017wacv-fused/) doi:10.1109/WACV.2017.111BibTeX
@inproceedings{du2017wacv-fused,
title = {{Fused DNN: A Deep Neural Network Fusion Approach to Fast and Robust Pedestrian Detection}},
author = {Du, Xianzhi and El-Khamy, Mostafa and Lee, Jungwon and Davis, Larry S.},
booktitle = {IEEE/CVF Winter Conference on Applications of Computer Vision},
year = {2017},
pages = {953-961},
doi = {10.1109/WACV.2017.111},
url = {https://mlanthology.org/wacv/2017/du2017wacv-fused/}
}