An Ensemble Approach for Facial Behavior Analysis In-the-Wild Video
Abstract
Human emotions recognization contributes to the development of human-computer interaction. The machines understanding human emotions in the real world will significantly contribute to life in the future. This paper introduces the 3rd Affective Behavior Analysis in-the-wild (ABAW3) 2022 challenge. We focused on solving the problem of the Valence-Arousal (VA) estimation and Action Unit (AU) detection. For valence-arousal estimation, we conducted two stages: creating new features from multimodel and temporal learning to predict valence-arousal. First, we make new features; the Gated Recurrent Unit (GRU) and Transformer are combined using a Regular Networks (RegNet) feature, which is extracted from the image. The next step is the GRU combined with local attention to predict valencearousal. The Concordance Correlation Coefficient (CCC) was used to evaluate the model. The result achieved 0.450 for valence and 0.445 for arousal on the test set, outperforming the baseline method with a corresponding CCC of 0.180 for valence and 0.170 for arousal. We also performed additional experiments on the action unit task with simple transformer blocks. We achieved a score of 49.04 on the test set in terms of F1 score, which outperforms the baseline method with a corresponding F1 score of 36.50. Our submission to ABAW3 2022 ranks 3rd for both tasks.
Cite
Text
Nguyen et al. "An Ensemble Approach for Facial Behavior Analysis In-the-Wild Video." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022. doi:10.1109/CVPRW56347.2022.00281Markdown
[Nguyen et al. "An Ensemble Approach for Facial Behavior Analysis In-the-Wild Video." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022.](https://mlanthology.org/cvprw/2022/nguyen2022cvprw-ensemble/) doi:10.1109/CVPRW56347.2022.00281BibTeX
@inproceedings{nguyen2022cvprw-ensemble,
title = {{An Ensemble Approach for Facial Behavior Analysis In-the-Wild Video}},
author = {Nguyen, Hong Hai and Huynh, Van Thong and Kim, Soo-Hyung},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2022},
pages = {2511-2516},
doi = {10.1109/CVPRW56347.2022.00281},
url = {https://mlanthology.org/cvprw/2022/nguyen2022cvprw-ensemble/}
}