Detection and Tracking of Multiple Humans with Extensive Pose Articulation
Abstract
We describe a method for detecting and tracking humans. Different from most of the previous work, we focus on humans with extensive pose articulations, under situations where there is typically only a single camera, multiple humans are present and the image resolution is low. In our method pose clusters are learned from an embedded silhouette manifold. A set of object detectors, each of which corresponds to one pose cluster, are trained based on a novel Object-Weighted Appearance Model. A probabilistic pose-based transition model is used to track multiple objects within a sliding window buffer, making use of the detection responses. The track segments in the sliding windows are connected sequentially into full trajectories. Experiments on a set of challenging surveillance videos are presented; these show good performance of our approach compared to standard pedestrian detectors, under difficult conditions.
Cite
Text
Zhang et al. "Detection and Tracking of Multiple Humans with Extensive Pose Articulation." IEEE/CVF International Conference on Computer Vision, 2007. doi:10.1109/ICCV.2007.4408940Markdown
[Zhang et al. "Detection and Tracking of Multiple Humans with Extensive Pose Articulation." IEEE/CVF International Conference on Computer Vision, 2007.](https://mlanthology.org/iccv/2007/zhang2007iccv-detection/) doi:10.1109/ICCV.2007.4408940BibTeX
@inproceedings{zhang2007iccv-detection,
title = {{Detection and Tracking of Multiple Humans with Extensive Pose Articulation}},
author = {Zhang, Li and Wu, Bo and Nevatia, Ramakant},
booktitle = {IEEE/CVF International Conference on Computer Vision},
year = {2007},
pages = {1-8},
doi = {10.1109/ICCV.2007.4408940},
url = {https://mlanthology.org/iccv/2007/zhang2007iccv-detection/}
}