Depth and Appearance for Mobile Scene Analysis
Abstract
In this paper, we address the challenging problem of simultaneous pedestrian detection and ground-plane estimation from video while walking through a busy pedestrian zone. Our proposed system integrates robust stereo depth cues, ground-plane estimation, and appearance-based object detection in a principled fashion using a graphical model. Object-object occlusions lead to complex interactions in this model that make an exact solution computationally intractable. We therefore propose a novel iterative approach that first infers scene geometry using Belief Propagation and then resolves interactions between objects using a global optimization procedure. This approach leads to a robust solution in few iterations, while allowing object detection to benefit from geometry estimation and vice versa. We quantitatively evaluate the performance of our proposed approach on several challenging test sequences showing strolls through busy shopping streets. Comparisons to various baseline systems show that it outperforms both a system using no scene geometry and one just relying on Structure-from-Motion without dense stereo.
Cite
Text
Ess et al. "Depth and Appearance for Mobile Scene Analysis." IEEE/CVF International Conference on Computer Vision, 2007. doi:10.1109/ICCV.2007.4409092Markdown
[Ess et al. "Depth and Appearance for Mobile Scene Analysis." IEEE/CVF International Conference on Computer Vision, 2007.](https://mlanthology.org/iccv/2007/ess2007iccv-depth/) doi:10.1109/ICCV.2007.4409092BibTeX
@inproceedings{ess2007iccv-depth,
title = {{Depth and Appearance for Mobile Scene Analysis}},
author = {Ess, Andreas and Leibe, Bastian and Van Gool, Luc},
booktitle = {IEEE/CVF International Conference on Computer Vision},
year = {2007},
pages = {1-8},
doi = {10.1109/ICCV.2007.4409092},
url = {https://mlanthology.org/iccv/2007/ess2007iccv-depth/}
}