Towards Unlocking Web Video: Automatic People Tracking and Clustering

Abstract

This paper describes a system for automatically extracting meta-information on people from videos on the Web. The system contains multiple modules which automatically track people, including both faces and bodies, and clusters the people into distinct groups. We present new technology and significantly modify existing algorithms for body-detection, shot-detection and grouping, tracking, and track-clustering within our system. The system was designed to work effectivity on Web content, and thus exhibits robust tracking and clustering behavior over a broad spectrum of professional and semi-professional video content. In order to quantify and evaluate our system we created a large ground-truth data-set of people within video. Finally, we provide actual video examples of our algorithm and find that the results are quite strong over a broad range of content.

Cite

Text

Holub et al. "Towards Unlocking Web Video: Automatic People Tracking and Clustering." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2009. doi:10.1109/CVPRW.2009.5204190

Markdown

[Holub et al. "Towards Unlocking Web Video: Automatic People Tracking and Clustering." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2009.](https://mlanthology.org/cvprw/2009/holub2009cvprw-unlocking/) doi:10.1109/CVPRW.2009.5204190

BibTeX

@inproceedings{holub2009cvprw-unlocking,
  title     = {{Towards Unlocking Web Video: Automatic People Tracking and Clustering}},
  author    = {Holub, Alex and Moreels, Pierre and Islam, Atiq and Makhanov, Andrei and Yang, Rui},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2009},
  pages     = {47-54},
  doi       = {10.1109/CVPRW.2009.5204190},
  url       = {https://mlanthology.org/cvprw/2009/holub2009cvprw-unlocking/}
}