Query-Focused Extractive Video Summarization

Abstract

Video data is explosively growing. As a result of the “big video data”, intelligent algorithms for automatic video summarization have (re-)emerged as a pressing need. We develop a probabilistic model, Sequential and Hierarchical Determinantal Point Process (SH-DPP), for query-focused extractive video summarization. Given a user query and a long video sequence, our algorithm returns a summary by selecting key shots from the video. The decision to include a shot in the summary depends on the shot’s relevance to the user query and importance in the context of the video, jointly. We verify our approach on two densely annotated video datasets. The query-focused video summarization is particularly useful for search engines, e.g., to display snippets of videos.

Cite

Text

Sharghi et al. "Query-Focused Extractive Video Summarization." European Conference on Computer Vision, 2016. doi:10.1007/978-3-319-46484-8_1

Markdown

[Sharghi et al. "Query-Focused Extractive Video Summarization." European Conference on Computer Vision, 2016.](https://mlanthology.org/eccv/2016/sharghi2016eccv-query/) doi:10.1007/978-3-319-46484-8_1

BibTeX

@inproceedings{sharghi2016eccv-query,
  title     = {{Query-Focused Extractive Video Summarization}},
  author    = {Sharghi, Aidean and Gong, Boqing and Shah, Mubarak},
  booktitle = {European Conference on Computer Vision},
  year      = {2016},
  pages     = {3-19},
  doi       = {10.1007/978-3-319-46484-8_1},
  url       = {https://mlanthology.org/eccv/2016/sharghi2016eccv-query/}
}