Automatic Video Summarization by Graph Modeling

Abstract

We propose a unified approach for summarization based on the analysis of video structures and video highlights. Our approach emphasizes both the content balance and perceptual quality of a summary. Normalized cut algorithm is employed to globally and optimally partition a video into clusters. A motion attention model based on human perception is employed to compute the perceptual quality of shots and clusters. The clusters, together with the computed attention values, form a temporal graph similar to Markov chain that inherently describes the evolution and perceptual importance of video clusters. In our application, the flow of a temporal graph is utilized to group similar clusters into scenes, while the attention values are used as guidelines to select appropriate subshots in scenes for summarization.

Cite

Text

Ngo et al. "Automatic Video Summarization by Graph Modeling." IEEE/CVF International Conference on Computer Vision, 2003. doi:10.1109/ICCV.2003.1238320

Markdown

[Ngo et al. "Automatic Video Summarization by Graph Modeling." IEEE/CVF International Conference on Computer Vision, 2003.](https://mlanthology.org/iccv/2003/ngo2003iccv-automatic/) doi:10.1109/ICCV.2003.1238320

BibTeX

@inproceedings{ngo2003iccv-automatic,
  title     = {{Automatic Video Summarization by Graph Modeling}},
  author    = {Ngo, Chong-Wah and Ma, Yufei and Zhang, HongJiang},
  booktitle = {IEEE/CVF International Conference on Computer Vision},
  year      = {2003},
  pages     = {104-109},
  doi       = {10.1109/ICCV.2003.1238320},
  url       = {https://mlanthology.org/iccv/2003/ngo2003iccv-automatic/}
}