"Who Are You?" - Learning Person Specific Classifiers from Video
Abstract
We investigate the problem of automatically labelling faces of characters in TV or movie material with their names, using only weak supervision from automatically-aligned subtitle and script text. Our previous work (Everingham et al. [8]) demonstrated promising results on the task, but the coverage of the method (proportion of video labelled) and generalization was limited by a restriction to frontal faces and nearest neighbour classification. In this paper we build on that method, extending the coverage greatly by the detection and recognition of characters in profile views. In addition, we make the following contributions: (i) seamless tracking, integration and recognition of profile and frontal detections, and (ii) a character specific multiple kernel classifier which is able to learn the features best able to discriminate between the characters. We report results on seven episodes of the TV series "Buffy the Vampire Slayer", demonstrating significantly increased coverage and performance with respect to previous methods on this material.
Cite
Text
Sivic et al. ""Who Are You?" - Learning Person Specific Classifiers from Video." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2009. doi:10.1109/CVPR.2009.5206513Markdown
[Sivic et al. ""Who Are You?" - Learning Person Specific Classifiers from Video." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2009.](https://mlanthology.org/cvpr/2009/sivic2009cvpr-you/) doi:10.1109/CVPR.2009.5206513BibTeX
@inproceedings{sivic2009cvpr-you,
title = {{"Who Are You?" - Learning Person Specific Classifiers from Video}},
author = {Sivic, Josef and Everingham, Mark and Zisserman, Andrew},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2009},
pages = {1145-1152},
doi = {10.1109/CVPR.2009.5206513},
url = {https://mlanthology.org/cvpr/2009/sivic2009cvpr-you/}
}