Bi-Modal First Impressions Recognition Using Temporally Ordered Deep Audio and Stochastic Visual Features

Abstract

We propose a novel approach for First Impressions Recognition in terms of the Big Five personality-traits from short videos. The Big Five personality traits is a model to describe human personality using five broad categories: Extraversion, Agreeableness, Conscientiousness, Neuroticism and Openness. We train two bi-modal end-to-end deep neural network architectures using temporally ordered audio and novel stochastic visual features from few frames, without over-fitting. We empirically show that the trained models perform exceptionally well, even after training from a small sub-portions of inputs. Our method is evaluated in ChaLearn LAP 2016 Apparent Personality Analysis (APA) competition using ChaLearn LAP APA2016 dataset and achieved excellent performance.

Cite

Text

Subramaniam et al. "Bi-Modal First Impressions Recognition Using Temporally Ordered Deep Audio and Stochastic Visual Features." European Conference on Computer Vision Workshops, 2016. doi:10.1007/978-3-319-49409-8_27

Markdown

[Subramaniam et al. "Bi-Modal First Impressions Recognition Using Temporally Ordered Deep Audio and Stochastic Visual Features." European Conference on Computer Vision Workshops, 2016.](https://mlanthology.org/eccvw/2016/subramaniam2016eccvw-bimodal/) doi:10.1007/978-3-319-49409-8_27

BibTeX

@inproceedings{subramaniam2016eccvw-bimodal,
  title     = {{Bi-Modal First Impressions Recognition Using Temporally Ordered Deep Audio and Stochastic Visual Features}},
  author    = {Subramaniam, Arulkumar and Patel, Vismay and Mishra, Ashish and Balasubramanian, Prashanth and Mittal, Anurag},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2016},
  pages     = {337-348},
  doi       = {10.1007/978-3-319-49409-8_27},
  url       = {https://mlanthology.org/eccvw/2016/subramaniam2016eccvw-bimodal/}
}