Fusion of Granular-Ball Visual Spatial Representations for Enhanced Facial Expression Recognition

Abstract

Facial Expression Recognition (FER) is a fundamental problem in computer vision. Despite recent advances, significant challenges remain. Current methods primarily focus on extracting visual representations while overlooking other valuable information. To address this limitation, we propose a novel method called Component Separation and Granular-ball Space Bootstrap Fusion (CS-GBSBF), which leverages granular balls to transform visual images to spatial graphs, thereby enlarging the spatial information embedded in images. Our method separates the face into different components and utilizes the spatial information to bootstrap the fusion. More specifically, CS-GBSBF mainly consists of three crucial networks: Represent Extraction Network (REN), Represent Separation Network (RSN) and Represent Fusion Network (RFN). First, granular balls are used to represent expression images as graphs, which are fed into REN along with images. Then, RSN separates basic visual/spatial representations extracted from REN into a set of component visual/spatial representations. Next, RFN utilizes spatial representations to bootstrap component visual integration. A significant challenge in two-stream models is feature alignment, for which we have developed Attention Guidance Module (AGM) and Bootstrap Alignment Loss (L_BA) in REN and RFN, respectively. Results of experiment on eight databases show that CS-GBSBF consistently achieves higher recognition accuracy than several state-of-the-art methods. The code is available at https://github.com/Lsy235/CS-GBSBF.

Cite

Text

Liu et al. "Fusion of Granular-Ball Visual Spatial Representations for Enhanced Facial Expression Recognition." International Joint Conference on Artificial Intelligence, 2025. doi:10.24963/IJCAI.2025/178

Markdown

[Liu et al. "Fusion of Granular-Ball Visual Spatial Representations for Enhanced Facial Expression Recognition." International Joint Conference on Artificial Intelligence, 2025.](https://mlanthology.org/ijcai/2025/liu2025ijcai-fusion/) doi:10.24963/IJCAI.2025/178

BibTeX

@inproceedings{liu2025ijcai-fusion,
  title     = {{Fusion of Granular-Ball Visual Spatial Representations for Enhanced Facial Expression Recognition}},
  author    = {Liu, Shuaiyu and Shen, Qiyao and Wang, Yunxi and Ren, Yazhou and Wang, Guoyin},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {1594-1602},
  doi       = {10.24963/IJCAI.2025/178},
  url       = {https://mlanthology.org/ijcai/2025/liu2025ijcai-fusion/}
}