Video Summarization Using Singular Value Decomposition
Abstract
The authors propose a novel technique for video summarization based on singular value decomposition (SVD). For the input video sequence, we create a feature-frame matrix A, and perform the SVD on it. From this SVD, we are able, to not only derive the refined feature space to better cluster visually similar frames, but also define a metric to measure the amount of visual content contained in each frame cluster using its degree of visual changes. Then, in the refined feature space, we find the most static frame cluster, define it as the content unit, and use the context value computed from it as the threshold to cluster the rest of the frames. Based on this clustering result, either the optimal set of keyframes, or a summarized motion video with the user specified time length can be generated to support different user requirements for video browsing and content overview. Our approach ensures that the summarized video representation contains little redundancy, and gives equal attention to the same amount of contents.
Cite
Text
Gong and Liu. "Video Summarization Using Singular Value Decomposition." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2000. doi:10.1109/CVPR.2000.854772Markdown
[Gong and Liu. "Video Summarization Using Singular Value Decomposition." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2000.](https://mlanthology.org/cvpr/2000/gong2000cvpr-video/) doi:10.1109/CVPR.2000.854772BibTeX
@inproceedings{gong2000cvpr-video,
title = {{Video Summarization Using Singular Value Decomposition}},
author = {Gong, Yihong and Liu, Xin},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2000},
pages = {2174-2180},
doi = {10.1109/CVPR.2000.854772},
url = {https://mlanthology.org/cvpr/2000/gong2000cvpr-video/}
}