Towards Multimodal Sentiment Analysis via Hierarchical Correlation Modeling with Semantic Distribution Constraints
Abstract
Sentiment analysis is rapidly advancing by utilizing various data modalities (e.g., text, video, and audio). However, most existing techniques only learn the atomic-level features that reflect strong correlations, while ignoring more complex compositions in multimodal data. Moreover, they also neglected the incongruity in semantic distribution among modalities. In light of this, we introduce a novel Hierarchical Correlation Modeling Network (HCMNet), which enhances the multimodal sentiment analysis by exploring both the atomic-level correlations based on dynamic attention reasoning and the composition-level correlations through topological graph reasoning. In addition, we also alleviate the impact of distributional inconsistencies between modalities from both atomic-level and composition-level perspectives. Specifically, we first design an atomic-level contrastive loss that constrains the semantic distribution across modalities to mitigate the atomic-level inconsistency. Then, we design a graph optimal transport module that integrates transport flows with different graphs to constrain the composition-level semantic distribution, thus reducing the inconsistency of compositional nodes. Experiments on three public benchmark datasets have demonstrated the superiority of the proposed model over the state-of-the-art methods.
Cite
Text
Xu et al. "Towards Multimodal Sentiment Analysis via Hierarchical Correlation Modeling with Semantic Distribution Constraints." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I20.35484Markdown
[Xu et al. "Towards Multimodal Sentiment Analysis via Hierarchical Correlation Modeling with Semantic Distribution Constraints." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/xu2025aaai-multimodal/) doi:10.1609/AAAI.V39I20.35484BibTeX
@inproceedings{xu2025aaai-multimodal,
title = {{Towards Multimodal Sentiment Analysis via Hierarchical Correlation Modeling with Semantic Distribution Constraints}},
author = {Xu, Qinfu and Wei, Yiwei and Wu, Chunlei and Wang, Leiquan and Yuan, Shaozu and Wu, Jie and Lu, Jing and Zhou, Hengyang},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {21788-21796},
doi = {10.1609/AAAI.V39I20.35484},
url = {https://mlanthology.org/aaai/2025/xu2025aaai-multimodal/}
}