Category-Specific Video Summarization

Abstract

In large video collections with clusters of typical categories, such as “birthday party” or “flash-mob”, category-specific video summarization can produce higher quality video summaries than unsupervised approaches that are blind to the video category. Given a video from a known category, our approach first efficiently performs a temporal segmentation into semantically-consistent segments, delimited not only by shot boundaries but also general change points. Then, equipped with an SVM classifier, our approach assigns importance scores to each segment. The resulting video assembles the sequence of segments with the highest scores. The obtained video summary is therefore both short and highly informative. Experimental results on videos from the multimedia event detection (MED) dataset of TRECVID’11 show that our approach produces video summaries with higher relevance than the state of the art.

Cite

Text

Potapov et al. "Category-Specific Video Summarization." European Conference on Computer Vision, 2014. doi:10.1007/978-3-319-10599-4_35

Markdown

[Potapov et al. "Category-Specific Video Summarization." European Conference on Computer Vision, 2014.](https://mlanthology.org/eccv/2014/potapov2014eccv-category/) doi:10.1007/978-3-319-10599-4_35

BibTeX

@inproceedings{potapov2014eccv-category,
  title     = {{Category-Specific Video Summarization}},
  author    = {Potapov, Danila and Douze, Matthijs and Harchaoui, Zaïd and Schmid, Cordelia},
  booktitle = {European Conference on Computer Vision},
  year      = {2014},
  pages     = {540-555},
  doi       = {10.1007/978-3-319-10599-4_35},
  url       = {https://mlanthology.org/eccv/2014/potapov2014eccv-category/}
}