Learning Non-Linear Reconstruction Models for Image Set Classification

Abstract

We propose a deep learning framework for image set classification with application to face recognition. An Adaptive Deep Network Template (ADNT) is defined whose parameters are initialized by performing unsupervised pre-training in a layer-wise fashion using Gaussian Restricted Boltzmann Machines (GRBMs). The pre-initialized ADNT is then separately trained for images of each class and class-specific models are learnt. Based on the minimum reconstruction error from the learnt class-specific models, a majority voting strategy is used for classification. The proposed framework is extensively evaluated for the task of image set classification based face recognition on Honda/UCSD, CMU Mobo, YouTube Celebrities and a Kinect dataset. Our experimental results and comparisons with existing state-of-the-art methods show that the proposed method consistently achieves the best performance on all these datasets.

Cite

Text

Hayat et al. "Learning Non-Linear Reconstruction Models for Image Set Classification." Conference on Computer Vision and Pattern Recognition, 2014. doi:10.1109/CVPR.2014.246

Markdown

[Hayat et al. "Learning Non-Linear Reconstruction Models for Image Set Classification." Conference on Computer Vision and Pattern Recognition, 2014.](https://mlanthology.org/cvpr/2014/hayat2014cvpr-learning/) doi:10.1109/CVPR.2014.246

BibTeX

@inproceedings{hayat2014cvpr-learning,
  title     = {{Learning Non-Linear Reconstruction Models for Image Set Classification}},
  author    = {Hayat, Munawar and Bennamoun, Mohammed and An, Senjian},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2014},
  doi       = {10.1109/CVPR.2014.246},
  url       = {https://mlanthology.org/cvpr/2014/hayat2014cvpr-learning/}
}