Learning Discriminative Aggregation Network for Video-Based Face Recognition

Abstract

In this paper, we propose a discriminative aggregation network (DAN) for video face recognition, which aims to integrate information from video frames effectively and efficiently. Different from existing aggregation methods, our method aggregates raw video frames directly instead of the features obtained by complex processing. By combining the idea of metric learning and adversarial learning, we learn an aggregation network that produces more discriminative synthesized images compared to input frames. Our framework reduces the number of frames to be processed and greatly speed up the recognition procedure. Furthermore, low-quality frames containing misleading information are denoised during the aggregation process, making the system more robust and discriminative. Experimental results show that our framework can generate discriminative images from video clips and improve the overall recognition performance in both the speed and accuracy on three widely used datasets.

Cite

Text

Rao et al. "Learning Discriminative Aggregation Network for Video-Based Face Recognition." International Conference on Computer Vision, 2017. doi:10.1109/ICCV.2017.408

Markdown

[Rao et al. "Learning Discriminative Aggregation Network for Video-Based Face Recognition." International Conference on Computer Vision, 2017.](https://mlanthology.org/iccv/2017/rao2017iccv-learning/) doi:10.1109/ICCV.2017.408

BibTeX

@inproceedings{rao2017iccv-learning,
  title     = {{Learning Discriminative Aggregation Network for Video-Based Face Recognition}},
  author    = {Rao, Yongming and Lin, Ji and Lu, Jiwen and Zhou, Jie},
  booktitle = {International Conference on Computer Vision},
  year      = {2017},
  doi       = {10.1109/ICCV.2017.408},
  url       = {https://mlanthology.org/iccv/2017/rao2017iccv-learning/}
}