Correspondence Driven Adaptation for Human Profile Recognition
Abstract
Visual recognition systems for videos using statistical learning models often show degraded performance when being deployed to a real-world environment, primarily due to the fact that training data can hardly cover sufficient variations in reality. To alleviate this issue, we propose to utilize the object correspondences in successive frames as weak supervision to adapt visual recognition models, which is particularly suitable for human profile recognition. Specifically, we substantialize this new strategy on an advanced convolutional neural network (CNN) based system to estimate human gender, age, and race. We enforce the system to output consistent and stable results on face images from the same trajectories in videos by using incremental stochastic training. Our baseline system already achieves competitive performance on gender and age estimation as compared to the state-of-the-art algorithms on the FG-NET database. Further, on two new video datasets containing about 900 persons, the proposed supervision of correspondences improves the estimation accuracy by a large margin over the baseline.
Cite
Text
Yang et al. "Correspondence Driven Adaptation for Human Profile Recognition." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2011. doi:10.1109/CVPR.2011.5995481Markdown
[Yang et al. "Correspondence Driven Adaptation for Human Profile Recognition." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2011.](https://mlanthology.org/cvpr/2011/yang2011cvpr-correspondence/) doi:10.1109/CVPR.2011.5995481BibTeX
@inproceedings{yang2011cvpr-correspondence,
title = {{Correspondence Driven Adaptation for Human Profile Recognition}},
author = {Yang, Ming and Zhu, Shenghuo and Lv, Fengjun and Yu, Kai},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2011},
pages = {505-512},
doi = {10.1109/CVPR.2011.5995481},
url = {https://mlanthology.org/cvpr/2011/yang2011cvpr-correspondence/}
}