Motion Capture Using Joint Skeleton Tracking and Surface Estimation

Abstract

This paper proposes a method for capturing the performance of a human or an animal from a multi-view video sequence. Given an articulated template model and silhouettes from a multi-view image sequence, our approach recovers not only the movement of the skeleton, but also the possibly non-rigid temporal deformation of the 3D surface. While large scale deformations or fast movements are captured by the skeleton pose and approximate surface skinning, true small scale deformations or non-rigid garment motion are captured by fitting the surface to the silhouette. We further propose a novel optimization scheme for skeleton-based pose estimation that exploits the skeleton's tree structure to split the optimization problem into a local one and a lower dimensional global one. We show on various sequences that our approach can capture the 3D motion of animals and humans accurately even in the case of rapid movements and wide apparel like skirts.

Cite

Text

Gall et al. "Motion Capture Using Joint Skeleton Tracking and Surface Estimation." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2009. doi:10.1109/CVPR.2009.5206755

Markdown

[Gall et al. "Motion Capture Using Joint Skeleton Tracking and Surface Estimation." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2009.](https://mlanthology.org/cvpr/2009/gall2009cvpr-motion/) doi:10.1109/CVPR.2009.5206755

BibTeX

@inproceedings{gall2009cvpr-motion,
  title     = {{Motion Capture Using Joint Skeleton Tracking and Surface Estimation}},
  author    = {Gall, Juergen and Stoll, Carsten and de Aguiar, Edilson and Theobalt, Christian and Rosenhahn, Bodo and Seidel, Hans-Peter},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2009},
  pages     = {1746-1753},
  doi       = {10.1109/CVPR.2009.5206755},
  url       = {https://mlanthology.org/cvpr/2009/gall2009cvpr-motion/}
}