Learning the Predictability of the Future

Abstract

We introduce a framework for learning from unlabeled video what is predictable in the future. Instead of committing up front to features to predict, our approach learns from data which features are predictable. Based on the observation that hyperbolic geometry naturally and compactly encodes hierarchical structure, we propose a predictive model in hyperbolic space. When the model is most confident, it will predict at a concrete level of the hierarchy, but when the model is not confident, it learns to automatically select a higher level of abstraction. Experiments on two established datasets show the key role of hierarchical representations for action prediction. Although our representation is trained with unlabeled video, visualizations show that action hierarchies emerge in the representation.

Cite

Text

Suris et al. "Learning the Predictability of the Future." Conference on Computer Vision and Pattern Recognition, 2021. doi:10.1109/CVPR46437.2021.01242

Markdown

[Suris et al. "Learning the Predictability of the Future." Conference on Computer Vision and Pattern Recognition, 2021.](https://mlanthology.org/cvpr/2021/suris2021cvpr-learning/) doi:10.1109/CVPR46437.2021.01242

BibTeX

@inproceedings{suris2021cvpr-learning,
  title     = {{Learning the Predictability of the Future}},
  author    = {Suris, Didac and Liu, Ruoshi and Vondrick, Carl},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2021},
  pages     = {12607-12617},
  doi       = {10.1109/CVPR46437.2021.01242},
  url       = {https://mlanthology.org/cvpr/2021/suris2021cvpr-learning/}
}