3D Tracking = Classification + Interpolation

Abstract

Hand gestures are examples of fast and complex motions. Computers fail to track these in fast video, but sleight of hand fools humans as well: what happens too quickly we just cannot see. We show a 3D tracker for these types of motions that relies on the recognition of familiar configurations in 2D images (classification), and fills the gaps in-between (interpolation). We illustrate this idea with experiments on hand motions similar to finger spelling. The penalty for a recognition failure is often small: if two configurations are confused, they are often similar to each other, and the illusion works well enough, for instance, to drive a graphics animation of the moving hand. We contribute advances in both feature design and classifier training: our image features are invariant to image scale, translation, and rotation, and we propose a classification method that combines VQPCA with discrimination trees. 1.

Cite

Text

Tomasi et al. "3D Tracking = Classification + Interpolation." IEEE/CVF International Conference on Computer Vision, 2003. doi:10.1109/ICCV.2003.1238659

Markdown

[Tomasi et al. "3D Tracking = Classification + Interpolation." IEEE/CVF International Conference on Computer Vision, 2003.](https://mlanthology.org/iccv/2003/tomasi2003iccv-d/) doi:10.1109/ICCV.2003.1238659

BibTeX

@inproceedings{tomasi2003iccv-d,
  title     = {{3D Tracking = Classification + Interpolation}},
  author    = {Tomasi, Carlo and Petrov, Slav and Sastry, Arvind},
  booktitle = {IEEE/CVF International Conference on Computer Vision},
  year      = {2003},
  pages     = {1441-1448},
  doi       = {10.1109/ICCV.2003.1238659},
  url       = {https://mlanthology.org/iccv/2003/tomasi2003iccv-d/}
}