Unsupervised Video Understanding by Reconciliation of Posture Similarities

Abstract

Understanding human activity and being able to explain it in detail surpasses mere action classification by far in both complexity and value. The challenge is thus to describe an activity on the basis of its most fundamental constituents, the individual postures and their distinctive transitions. Supervised learning of such a fine-grained representation based on elementary poses is very tedious and does not scale. Therefore, we propose a completely unsupervised deep learning procedure based solely on video sequences, which starts from scratch without requiring pre-trained networks, predefined body models, or keypoints. A combinatorial sequence matching algorithm proposes relations between frames from subsets of the training data, while a CNN is reconciling the transitivity conflicts of the different subsets to learn a single concerted pose embedding despite changes in appearance across sequences. Without any manual annotation, the model learns a structured representation of postures and their temporal development. The model not only enables retrieval of similar postures but also temporal super-resolution. Additionally, based on a recurrent formulation, next frames can be synthesized.

Cite

Text

Milbich et al. "Unsupervised Video Understanding by Reconciliation of Posture Similarities." International Conference on Computer Vision, 2017. doi:10.1109/ICCV.2017.471

Markdown

[Milbich et al. "Unsupervised Video Understanding by Reconciliation of Posture Similarities." International Conference on Computer Vision, 2017.](https://mlanthology.org/iccv/2017/milbich2017iccv-unsupervised/) doi:10.1109/ICCV.2017.471

BibTeX

@inproceedings{milbich2017iccv-unsupervised,
  title     = {{Unsupervised Video Understanding by Reconciliation of Posture Similarities}},
  author    = {Milbich, Timo and Bautista, Miguel and Sutter, Ekaterina and Ommer, Bjorn},
  booktitle = {International Conference on Computer Vision},
  year      = {2017},
  doi       = {10.1109/ICCV.2017.471},
  url       = {https://mlanthology.org/iccv/2017/milbich2017iccv-unsupervised/}
}