Real-Time Upper Body Detection and 3D Pose Estimation in Monoscopic Images

Abstract

This paper presents a novel solution to the difficult task of both detecting and estimating the 3D pose of humans in monoscopic images. The approach consists of two parts. Firstly the location of a human is identified by a probabalistic assembly of detected body parts. Detectors for the face, torso and hands are learnt using adaBoost. A pose likliehood is then obtained using an a priori mixture model on body configuration and possible configurations assembled from available evidence using RANSAC. Once a human has been detected, the location is used to initialise a matching algorithm which matches the silhouette and edge map of a subject with a 3D model. This is done efficiently using chamfer matching, integral images and pose estimation from the initial detection stage. We demonstrate the application of the approach to large, cluttered natural images and at near framerate operation (16fps) on lower resolution video streams.

Cite

Text

Micilotta et al. "Real-Time Upper Body Detection and 3D Pose Estimation in Monoscopic Images." European Conference on Computer Vision, 2006. doi:10.1007/11744078_11

Markdown

[Micilotta et al. "Real-Time Upper Body Detection and 3D Pose Estimation in Monoscopic Images." European Conference on Computer Vision, 2006.](https://mlanthology.org/eccv/2006/micilotta2006eccv-real/) doi:10.1007/11744078_11

BibTeX

@inproceedings{micilotta2006eccv-real,
  title     = {{Real-Time Upper Body Detection and 3D Pose Estimation in Monoscopic Images}},
  author    = {Micilotta, Antonio S. and Ong, Eng-Jon and Bowden, Richard},
  booktitle = {European Conference on Computer Vision},
  year      = {2006},
  pages     = {139-150},
  doi       = {10.1007/11744078_11},
  url       = {https://mlanthology.org/eccv/2006/micilotta2006eccv-real/}
}