Deep Bimodal Regression for Apparent Personality Analysis

Abstract

Apparent personality analysis from short video sequences is a challenging problem in computer vision and multimedia research. In order to capture rich information from both the visual and audio modality of videos, we propose the Deep Bimodal Regression (DBR) framework. In DBR, for the visual modality, we modify the traditional convolutional neural networks for exploiting important visual cues. In addition, taking into account the model efficiency, we extract audio representations and build the linear regressor for the audio modality. For combining the complementary information from the two modalities, we ensemble these predicted regression scores by both early fusion and late fusion. Finally, based on the proposed framework, we come up with a solution for the Apparent Personality Analysis competition track in the ChaLearn Looking at People challenge in association with ECCV 2016. Our DBR is the winner (first place) of this challenge with 86 registered teams.

Cite

Text

Zhang et al. "Deep Bimodal Regression for Apparent Personality Analysis." European Conference on Computer Vision Workshops, 2016. doi:10.1007/978-3-319-49409-8_25

Markdown

[Zhang et al. "Deep Bimodal Regression for Apparent Personality Analysis." European Conference on Computer Vision Workshops, 2016.](https://mlanthology.org/eccvw/2016/zhang2016eccvw-deep/) doi:10.1007/978-3-319-49409-8_25

BibTeX

@inproceedings{zhang2016eccvw-deep,
  title     = {{Deep Bimodal Regression for Apparent Personality Analysis}},
  author    = {Zhang, Chen-Lin and Zhang, Hao and Wei, Xiu-Shen and Wu, Jianxin},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2016},
  pages     = {311-324},
  doi       = {10.1007/978-3-319-49409-8_25},
  url       = {https://mlanthology.org/eccvw/2016/zhang2016eccvw-deep/}
}