Ego-Topo: Environment Affordances from Egocentric Video

Nagarajan, Tushar; Li, Yanghao; Feichtenhofer, Christoph; Grauman, Kristen

doi:10.1109/CVPR42600.2020.00024

Ego-Topo: Environment Affordances from Egocentric Video

Tushar Nagarajan, Yanghao Li, Christoph Feichtenhofer, Kristen Grauman

CVPR 2020

doi:10.1109/CVPR42600.2020.00024 /cvpr/2020/nagarajan2020cvpr-egotopo/

Abstract

First-person video naturally brings the use of a physical environment to the forefront, since it shows the camera wearer interacting fluidly in a space based on his intentions. However, current methods largely separate the observed actions from the persistent space itself. We introduce a model for environment affordances that is learned directly from egocentric video. The main idea is to gain a human-centric model of a physical space (such as a kitchen) that captures (1) the primary spatial zones of interaction and (2) the likely activities they support. Our approach decomposes a space into a topological map derived from first-person activity, organizing an ego-video into a series of visits to the different zones. Further, we show how to link zones across multiple related environments (e.g., from videos of multiple kitchens) to obtain a consolidated representation of environment functionality. On EPIC-Kitchens and EGTEA+, we demonstrate our approach for learning scene affordances and anticipating future actions in long-form video.

PDF CVPR Semantic Scholar

Cite

Text

Nagarajan et al. "Ego-Topo: Environment Affordances from Egocentric Video." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.00024

Markdown

[Nagarajan et al. "Ego-Topo: Environment Affordances from Egocentric Video." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/nagarajan2020cvpr-egotopo/) doi:10.1109/CVPR42600.2020.00024

BibTeX

@inproceedings{nagarajan2020cvpr-egotopo,
  title     = {{Ego-Topo: Environment Affordances from Egocentric Video}},
  author    = {Nagarajan, Tushar and Li, Yanghao and Feichtenhofer, Christoph and Grauman, Kristen},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2020},
  doi       = {10.1109/CVPR42600.2020.00024},
  url       = {https://mlanthology.org/cvpr/2020/nagarajan2020cvpr-egotopo/}
}