POSTER: A Pyramid Cross-Fusion Transformer Network for Facial Expression Recognition

Abstract

Facial expression recognition (FER) is an important task in computer vision, having practical applications in areas such as human-computer interaction, education, health-care, and online monitoring. In this challenging FER task, there are three key issues especially prevalent: inter-class similarity, intra-class discrepancy, and scale sensitivity. While existing works typically address some of these issues, none have fully addressed all three challenges in a unified framework. In this paper, we propose a two-stream Pyramid crOss-fuSion TransformER network (POSTER), that aims to holistically solve all three issues. Specifically, we design a transformer-based cross-fusion method that enables effective collaboration of facial landmark features and image features to maximize proper attention to salient facial regions. Furthermore, POSTER employs a pyramid structure to promote scale invariance. Extensive experimental results demonstrate that our POSTER achieves new state-of-the-art results on RAF-DB (92.05%), FERPlus (91.62%), as well as AffectNet 7 class (67.31%) and 8 class (63.34%). Code is available at https://github.com/zczcwh/POSTER.

Cite

Text

Zheng et al. "POSTER: A Pyramid Cross-Fusion Transformer Network for Facial Expression Recognition." IEEE/CVF International Conference on Computer Vision Workshops, 2023. doi:10.1109/ICCVW60793.2023.00339

Markdown

[Zheng et al. "POSTER: A Pyramid Cross-Fusion Transformer Network for Facial Expression Recognition." IEEE/CVF International Conference on Computer Vision Workshops, 2023.](https://mlanthology.org/iccvw/2023/zheng2023iccvw-poster/) doi:10.1109/ICCVW60793.2023.00339

BibTeX

@inproceedings{zheng2023iccvw-poster,
  title     = {{POSTER: A Pyramid Cross-Fusion Transformer Network for Facial Expression Recognition}},
  author    = {Zheng, Ce and Mendieta, Matías and Chen, Chen},
  booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
  year      = {2023},
  pages     = {3138-3147},
  doi       = {10.1109/ICCVW60793.2023.00339},
  url       = {https://mlanthology.org/iccvw/2023/zheng2023iccvw-poster/}
}