Generalised Pose Estimation Using Depth

Abstract

Estimating the pose of an object, be it articulated, deformable or rigid, is an important task, with applications ranging from Human-Computer Interaction to environmental understanding. The idea of a general pose estimation framework, capable of being rapidly retrained to suit a variety of tasks, is appealing. In this paper a solution is proposed requiring only a set of labelled training images in order to be applied to many pose estimation tasks. This is achieved by treating pose estimation as a classification problem, with particle filtering used to provide non-discretised estimates. Depth information extracted from a calibrated stereo sequence, is used for background suppression and object scale estimation. The appearance and shape channels are then transformed to Local Binary Pattern histograms, and pose classification is performed via a randomised decision forest. To demonstrate flexibility, the approach is applied to two different situations, articulated hand pose and rigid head orientation, achieving 97% and 84% accurate estimation rates, respectively.

Cite

Text

Hadfield and Bowden. "Generalised Pose Estimation Using Depth." European Conference on Computer Vision, 2010. doi:10.1007/978-3-642-35749-7_24

Markdown

[Hadfield and Bowden. "Generalised Pose Estimation Using Depth." European Conference on Computer Vision, 2010.](https://mlanthology.org/eccv/2010/hadfield2010eccv-generalised/) doi:10.1007/978-3-642-35749-7_24

BibTeX

@inproceedings{hadfield2010eccv-generalised,
  title     = {{Generalised Pose Estimation Using Depth}},
  author    = {Hadfield, Simon and Bowden, Richard},
  booktitle = {European Conference on Computer Vision},
  year      = {2010},
  pages     = {312-325},
  doi       = {10.1007/978-3-642-35749-7_24},
  url       = {https://mlanthology.org/eccv/2010/hadfield2010eccv-generalised/}
}