Less Is More: Learning Highlight Detection from Video Duration

Xiong, Bo; Kalantidis, Yannis; Ghadiyaram, Deepti; Grauman, Kristen

doi:10.1109/CVPR.2019.00135

Less Is More: Learning Highlight Detection from Video Duration

Bo Xiong, Yannis Kalantidis, Deepti Ghadiyaram, Kristen Grauman

CVPR 2019

doi:10.1109/CVPR.2019.00135 /cvpr/2019/xiong2019cvpr-less/

Abstract

Highlight detection has the potential to significantly ease video browsing, but existing methods often suffer from expensive supervision requirements, where human viewers must manually identify highlights in training videos. We propose a scalable unsupervised solution that exploits video duration as an implicit supervision signal. Our key insight is that video segments from shorter user-generated videos are more likely to be highlights than those from longer videos, since users tend to be more selective about the content when capturing shorter videos. Leveraging this insight, we introduce a novel ranking framework that prefers segments from shorter videos, while properly accounting for the inherent noise in the (unlabeled) training data. We use it to train a highlight detector with 10M hashtagged Instagram videos. In experiments on two challenging public video highlight detection benchmarks, our method substantially improves the state-of-the-art for unsupervised highlight detection.

PDF CVPR Semantic Scholar

Cite

Text

Xiong et al. "Less Is More: Learning Highlight Detection from Video Duration." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. doi:10.1109/CVPR.2019.00135

Markdown

[Xiong et al. "Less Is More: Learning Highlight Detection from Video Duration." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019.](https://mlanthology.org/cvpr/2019/xiong2019cvpr-less/) doi:10.1109/CVPR.2019.00135

BibTeX

@inproceedings{xiong2019cvpr-less,
  title     = {{Less Is More: Learning Highlight Detection from Video Duration}},
  author    = {Xiong, Bo and Kalantidis, Yannis and Ghadiyaram, Deepti and Grauman, Kristen},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2019},
  doi       = {10.1109/CVPR.2019.00135},
  url       = {https://mlanthology.org/cvpr/2019/xiong2019cvpr-less/}
}