Towards Robust Multimodal Sentiment Analysis with Incomplete Data

Abstract

The field of Multimodal Sentiment Analysis (MSA) has recently witnessed an emerging direction seeking to tackle the issue of data incompleteness. Recognizing that the language modality typically contains dense sentiment information, we consider it as the dominant modality and present an innovative Language-dominated Noise-resistant Learning Network (LNLN) to achieve robust MSA. The proposed LNLN features a dominant modality correction (DMC) module and dominant modality based multimodal learning (DMML) module, which enhances the model's robustness across various noise scenarios by ensuring the quality of dominant modality representations. Aside from the methodical design, we perform comprehensive experiments under random data missing scenarios, utilizing diverse and meaningful settings on several popular datasets (e.g., MOSI, MOSEI, and SIMS), providing additional uniformity, transparency, and fairness compared to existing evaluations in the literature. Empirically, LNLN consistently outperforms existing baselines, demonstrating superior performance across these challenging and extensive evaluation metrics.

Cite

Text

Zhang et al. "Towards Robust Multimodal Sentiment Analysis with Incomplete Data." Neural Information Processing Systems, 2024. doi:10.52202/079017-1779

Markdown

[Zhang et al. "Towards Robust Multimodal Sentiment Analysis with Incomplete Data." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/zhang2024neurips-robust/) doi:10.52202/079017-1779

BibTeX

@inproceedings{zhang2024neurips-robust,
  title     = {{Towards Robust Multimodal Sentiment Analysis with Incomplete Data}},
  author    = {Zhang, Haoyu and Wang, Wenbin and Yu, Tianshu},
  booktitle = {Neural Information Processing Systems},
  year      = {2024},
  doi       = {10.52202/079017-1779},
  url       = {https://mlanthology.org/neurips/2024/zhang2024neurips-robust/}
}