An End-to-End Visual-Audio Attention Network for Emotion Recognition in User-Generated Videos
Abstract
Emotion recognition in user-generated videos plays an important role in human-centered computing. Existing methods mainly employ traditional two-stage shallow pipeline, i.e. extracting visual and/or audio features and training classifiers. In this paper, we propose to recognize video emotions in an end-to-end manner based on convolutional neural networks (CNNs). Specifically, we develop a deep Visual-Audio Attention Network (VAANet), a novel architecture that integrates spatial, channel-wise, and temporal attentions into a visual 3D CNN and temporal attentions into an audio 2D CNN. Further, we design a special classification loss, i.e. polarity-consistent cross-entropy loss, based on the polarity-emotion hierarchy constraint to guide the attention generation. Extensive experiments conducted on the challenging VideoEmotion-8 and Ekman-6 datasets demonstrate that the proposed VAANet outperforms the state-of-the-art approaches for video emotion recognition. Our source code is released at: https://github.com/maysonma/VAANet.
Cite
Text
Zhao et al. "An End-to-End Visual-Audio Attention Network for Emotion Recognition in User-Generated Videos." AAAI Conference on Artificial Intelligence, 2020. doi:10.1609/AAAI.V34I01.5364Markdown
[Zhao et al. "An End-to-End Visual-Audio Attention Network for Emotion Recognition in User-Generated Videos." AAAI Conference on Artificial Intelligence, 2020.](https://mlanthology.org/aaai/2020/zhao2020aaai-end/) doi:10.1609/AAAI.V34I01.5364BibTeX
@inproceedings{zhao2020aaai-end,
title = {{An End-to-End Visual-Audio Attention Network for Emotion Recognition in User-Generated Videos}},
author = {Zhao, Sicheng and Ma, Yunsheng and Gu, Yang and Yang, Jufeng and Xing, Tengfei and Xu, Pengfei and Hu, Runbo and Chai, Hua and Keutzer, Kurt},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2020},
pages = {303-311},
doi = {10.1609/AAAI.V34I01.5364},
url = {https://mlanthology.org/aaai/2020/zhao2020aaai-end/}
}