A Multi-Focus-Driven Multi-Branch Network for Robust Multimodal Sentiment Analysis

Abstract

Multimodal sentiment analysis aims to integrate diverse modalities for precise emotional interpretation. However, external factors such as sensor malfunctions or network issues may disrupt certain modalities. This may lead to missing data, which poses challenges in real-world deployment. Most existing approaches focus on designing feature reconstruction strategies, overlooking the collaborative integration of reconstruction and fusion strategies. Moreover, they fail to capture the relationships between features in the global dimension and those in the local dimension. These limitations hinder the full capture of the complex nature of multimodal data, especially in scenarios involving missing modalities. To address the above issues, this paper proposes a robust model named MFMB-Net with multiple branches for feature multi-focus fusion and reconstruction. We design a two-stream fusion branch where macro-fusion focuses on the fusion of features in the global dimension and micro-fusion targets local dimension features. This dual-stream fusion branch distributes multi-focus across both pathways, simultaneously capturing global coarse-grained and local fine-grained features. Additionally, the reconstruction branch interacts collaboratively with the fusion branch to reconstruct and enhance the missing data. It integrates the reconstructed feature information with the fused information thus refining the representation fidelity of the missing information. Experiments performed on two benchmarks show that our approach obtains results superior to state-of-the-art models.

Cite

Text

Tao et al. "A Multi-Focus-Driven Multi-Branch Network for Robust Multimodal Sentiment Analysis." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I2.32146

Markdown

[Tao et al. "A Multi-Focus-Driven Multi-Branch Network for Robust Multimodal Sentiment Analysis." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/tao2025aaai-multi/) doi:10.1609/AAAI.V39I2.32146

BibTeX

@inproceedings{tao2025aaai-multi,
  title     = {{A Multi-Focus-Driven Multi-Branch Network for Robust Multimodal Sentiment Analysis}},
  author    = {Tao, Chuanqi and Li, Jiaming and Zang, Tianzi and Gao, Peng},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {1547-1555},
  doi       = {10.1609/AAAI.V39I2.32146},
  url       = {https://mlanthology.org/aaai/2025/tao2025aaai-multi/}
}