Spatio-Temporal Action Graph Networks

Herzig, Roei; Levi, Elad; Xu, Huijuan; Gao, Hang; Brosh, Eli; Wang, Xiaolong; Globerson, Amir; Darrell, Trevor

doi:10.1109/ICCVW.2019.00288

Spatio-Temporal Action Graph Networks

Roei Herzig, Elad Levi, Huijuan Xu, Hang Gao, Eli Brosh, Xiaolong Wang, Amir Globerson, Trevor Darrell

ICCVW 2019 pp. 2347-2356

doi:10.1109/ICCVW.2019.00288 /iccvw/2019/herzig2019iccvw-spatiotemporal/

Abstract

Events defined by the interaction of objects in a scene are often of critical importance; yet important events may have insufficient labeled examples to train a conventional deep model to generalize to future object appearance. Activity recognition models that represent object interactions explicitly have the potential to learn in a more efficient manner than those that represent scenes with global descriptors. We propose a novel inter-object graph representation for activity recognition based on a disentangled graph embedding with direct observation of edge appearance. In contrast to prior efforts, our approach uses explicit appearance for high order relations derived from object-object interaction, formed over regions that are the union of the spatial extent of the constituent objects. We employ a novel factored embedding of the graph structure, disentangling a representation hierarchy formed over spatial dimensions from that found over temporal variation. We demonstrate the effectiveness of our model on the Charades activity recognition benchmark, as well as a new dataset of driving activities focusing on multi-object interactions with near-collision events. Our model offers significantly improved performance compared to baseline approaches without object-graph representations, or with previous graph-based models.

PDF ICCVW Semantic Scholar

Cite

Text

Herzig et al. "Spatio-Temporal Action Graph Networks." IEEE/CVF International Conference on Computer Vision Workshops, 2019. doi:10.1109/ICCVW.2019.00288

Markdown

[Herzig et al. "Spatio-Temporal Action Graph Networks." IEEE/CVF International Conference on Computer Vision Workshops, 2019.](https://mlanthology.org/iccvw/2019/herzig2019iccvw-spatiotemporal/) doi:10.1109/ICCVW.2019.00288

BibTeX

@inproceedings{herzig2019iccvw-spatiotemporal,
  title     = {{Spatio-Temporal Action Graph Networks}},
  author    = {Herzig, Roei and Levi, Elad and Xu, Huijuan and Gao, Hang and Brosh, Eli and Wang, Xiaolong and Globerson, Amir and Darrell, Trevor},
  booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
  year      = {2019},
  pages     = {2347-2356},
  doi       = {10.1109/ICCVW.2019.00288},
  url       = {https://mlanthology.org/iccvw/2019/herzig2019iccvw-spatiotemporal/}
}