Efficient Extraction of Human Motion Volumes by Tracking
Abstract
We present an automatic and efficient method to extract spatio-temporal human volumes from video, which combines top-down model-based and bottom-up appearance-based approaches. From the top-down perspective, our algorithm applies shape priors probabilistically to candidate image regions obtained by pedestrian detection, and provides accurate estimates of the human body areas which serve as important constraints for bottom-up processing. Temporal propagation of the identified region is performed with bottom-up cues in an efficient level-set framework, which takes advantage of the sparse top-down information that is available. Our formulation also optimizes the extracted human volume across frames through belief propagation and provides temporally coherent human regions. We demonstrate the ability of our method to extract human body regions efficiently and automatically from a large, challenging dataset collected from YouTube.
Cite
Text
Niebles et al. "Efficient Extraction of Human Motion Volumes by Tracking." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2010. doi:10.1109/CVPR.2010.5540152Markdown
[Niebles et al. "Efficient Extraction of Human Motion Volumes by Tracking." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2010.](https://mlanthology.org/cvpr/2010/niebles2010cvpr-efficient/) doi:10.1109/CVPR.2010.5540152BibTeX
@inproceedings{niebles2010cvpr-efficient,
title = {{Efficient Extraction of Human Motion Volumes by Tracking}},
author = {Niebles, Juan Carlos and Han, Bohyung and Fei-Fei, Li},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2010},
pages = {655-662},
doi = {10.1109/CVPR.2010.5540152},
url = {https://mlanthology.org/cvpr/2010/niebles2010cvpr-efficient/}
}