Semantic Text Summarization of Long Videos

Abstract

Long videos captured by consumers are typically tied to some of the most important moments of their lives, yet ironically are often the least frequently watched. The time required to initially retrieve and watch sections can be daunting. In this work we propose novel techniques for summarizing and annotating long videos. Existing video summarization techniques focus exclusively on identifying keyframes and subshots, however evaluating these summarized videos is a challenging task. Our work proposes methods to generate visual summaries of long videos, and in addition proposes techniques to annotate and generate textual summaries of the videos using recurrent networks. Interesting segments of long video are extracted based on image quality as well as cinematographic and consumer preference. Key frames from the most impactful segments are converted to textual annotations using sequential encoding and decoding deep learning models. Our summarization technique is benchmarked on the VideoSet dataset, and evaluated by humans for informative and linguistic content. We believe this to be the first fully automatic method capable of simultaneous visual and textual summarization of long consumer videos.

Cite

Text

Sah et al. "Semantic Text Summarization of Long Videos." IEEE/CVF Winter Conference on Applications of Computer Vision, 2017. doi:10.1109/WACV.2017.115

Markdown

[Sah et al. "Semantic Text Summarization of Long Videos." IEEE/CVF Winter Conference on Applications of Computer Vision, 2017.](https://mlanthology.org/wacv/2017/sah2017wacv-semantic/) doi:10.1109/WACV.2017.115

BibTeX

@inproceedings{sah2017wacv-semantic,
  title     = {{Semantic Text Summarization of Long Videos}},
  author    = {Sah, Shagan and Kulhare, Sourabh and Gray, Allison and Venugopalan, Subhashini and Prud'hommeaux, Emily and Ptucha, Raymond W.},
  booktitle = {IEEE/CVF Winter Conference on Applications of Computer Vision},
  year      = {2017},
  pages     = {989-997},
  doi       = {10.1109/WACV.2017.115},
  url       = {https://mlanthology.org/wacv/2017/sah2017wacv-semantic/}
}