Learning Non-Linear Reconstruction Models for Image Set Classification
Abstract
We propose a deep learning framework for image set classification with application to face recognition. An Adaptive Deep Network Template (ADNT) is defined whose parameters are initialized by performing unsupervised pre-training in a layer-wise fashion using Gaussian Restricted Boltzmann Machines (GRBMs). The pre-initialized ADNT is then separately trained for images of each class and class-specific models are learnt. Based on the minimum reconstruction error from the learnt class-specific models, a majority voting strategy is used for classification. The proposed framework is extensively evaluated for the task of image set classification based face recognition on Honda/UCSD, CMU Mobo, YouTube Celebrities and a Kinect dataset. Our experimental results and comparisons with existing state-of-the-art methods show that the proposed method consistently achieves the best performance on all these datasets.
Cite
Text
Hayat et al. "Learning Non-Linear Reconstruction Models for Image Set Classification." Conference on Computer Vision and Pattern Recognition, 2014. doi:10.1109/CVPR.2014.246Markdown
[Hayat et al. "Learning Non-Linear Reconstruction Models for Image Set Classification." Conference on Computer Vision and Pattern Recognition, 2014.](https://mlanthology.org/cvpr/2014/hayat2014cvpr-learning/) doi:10.1109/CVPR.2014.246BibTeX
@inproceedings{hayat2014cvpr-learning,
title = {{Learning Non-Linear Reconstruction Models for Image Set Classification}},
author = {Hayat, Munawar and Bennamoun, Mohammed and An, Senjian},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2014},
doi = {10.1109/CVPR.2014.246},
url = {https://mlanthology.org/cvpr/2014/hayat2014cvpr-learning/}
}