Multi-Task Learning Framework for Emotion Recognition In-the-Wild

Zhang, Tenggan; Liu, Chuanhe; Liu, Xiaolong; Liu, Yuchen; Meng, Liyu; Sun, Lei; Jiang, Wenqiang; Zhang, Fengyuan; Zhao, Jinming; Jin, Qin

doi:10.1007/978-3-031-25075-0_11

Multi-Task Learning Framework for Emotion Recognition In-the-Wild

Tenggan Zhang, Chuanhe Liu, Xiaolong Liu, Yuchen Liu, Liyu Meng, Lei Sun, Wenqiang Jiang, Fengyuan Zhang, Jinming Zhao, Qin Jin

ECCVW 2022 pp. 143-156

doi:10.1007/978-3-031-25075-0_11 /eccvw/2022/zhang2022eccvw-multitask/

Abstract

This paper presents our system for the Multi-Task Learning (MTL) Challenge in the 4th Affective Behavior Analysis in-the-wild (ABAW) competition. We explore the research problems of this challenge from three aspects: 1) For obtaining efficient and robust visual feature representations, we propose MAE-based unsupervised representation learning and IResNet/DenseNet-based supervised representation learning methods; 2) Considering the importance of temporal information in videos, we explore three types of sequential encoders to capture the temporal information, including the encoder based on transformer, the encoder based on LSTM, and the encoder based on GRU; 3) For modeling the correlation between these different tasks (i.e., valence, arousal, expression, and AU) for multi-task affective analysis, we first explore the dependency between these different tasks and propose three multi-task learning frameworks to model the correlations effectively. Our system achieves the performance of $1.7607$ on the validation dataset and $1.4361$ on the test dataset, ranking first in the MTL Challenge. The code is available at https://github.com/AIM3-RUC/ABAW4.

PDF ECCVW Semantic Scholar

Cite

Text

Zhang et al. "Multi-Task Learning Framework for Emotion Recognition In-the-Wild." European Conference on Computer Vision Workshops, 2022. doi:10.1007/978-3-031-25075-0_11

Markdown

[Zhang et al. "Multi-Task Learning Framework for Emotion Recognition In-the-Wild." European Conference on Computer Vision Workshops, 2022.](https://mlanthology.org/eccvw/2022/zhang2022eccvw-multitask/) doi:10.1007/978-3-031-25075-0_11

BibTeX

@inproceedings{zhang2022eccvw-multitask,
  title     = {{Multi-Task Learning Framework for Emotion Recognition In-the-Wild}},
  author    = {Zhang, Tenggan and Liu, Chuanhe and Liu, Xiaolong and Liu, Yuchen and Meng, Liyu and Sun, Lei and Jiang, Wenqiang and Zhang, Fengyuan and Zhao, Jinming and Jin, Qin},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2022},
  pages     = {143-156},
  doi       = {10.1007/978-3-031-25075-0_11},
  url       = {https://mlanthology.org/eccvw/2022/zhang2022eccvw-multitask/}
}