SEMBED: Semantic Embedding of Egocentric Action Videos

Michael Wray, Davide Moltisanti, Walterio W. Mayol-Cuevas, Dima Damen

ECCVW 2016 pp. 532-545

doi:10.1007/978-3-319-46604-0_38 /eccvw/2016/wray2016eccvw-sembed/

Abstract

We present SEMBED, an approach for embedding an egocentric object interaction video in a semantic-visual graph to estimate the probability distribution over its potential semantic labels. When object interactions are annotated using unbounded choice of verbs, we embrace the wealth and ambiguity of these labels by capturing the semantic relationships as well as the visual similarities over motion and appearance features. We show how SEMBED can interpret a challenging dataset of 1225 freely annotated egocentric videos, outperforming SVM classification by more than 5 %.

PDF ECCVW Semantic Scholar

Cite

Text

Wray et al. "SEMBED: Semantic Embedding of Egocentric Action Videos." European Conference on Computer Vision Workshops, 2016. doi:10.1007/978-3-319-46604-0_38

Markdown

[Wray et al. "SEMBED: Semantic Embedding of Egocentric Action Videos." European Conference on Computer Vision Workshops, 2016.](https://mlanthology.org/eccvw/2016/wray2016eccvw-sembed/) doi:10.1007/978-3-319-46604-0_38

BibTeX

@inproceedings{wray2016eccvw-sembed,
  title     = {{SEMBED: Semantic Embedding of Egocentric Action Videos}},
  author    = {Wray, Michael and Moltisanti, Davide and Mayol-Cuevas, Walterio W. and Damen, Dima},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2016},
  pages     = {532-545},
  doi       = {10.1007/978-3-319-46604-0_38},
  url       = {https://mlanthology.org/eccvw/2016/wray2016eccvw-sembed/}
}