Exploiting Sparsity and Statistical Dependence in Multivariate Data Fusion: An Application to Misinformation Detection for High-Impact Events

Abstract

With the evolution of social media, cyberspace has become the de-facto medium for users to communicate during high-impact events such as natural disasters, terrorist attacks, and periods of political unrest. However, during such high-impact events, misinformation can spread rapidly on social media, affecting decision-making and creating social unrest. Identifying the spread of misinformation during high-impact events is a significant data challenge, given the multi-modal data associated with social media posts. Advances in multi-modal learning have shown promise for detecting misinformation; however, key limitations still make this a significant challenge. These limitations include the explicit and efficient modeling of the underlying non-linear associations of multi-modal data geared at misinformation detection. This paper presents a novel avenue of work that demonstrates how to frame the problem of misinformation detection in social media using multi-modal latent variable modeling and presents two novel algorithms capable of modeling the underlying associations of multi-modal data. We demonstrate the effectiveness of the proposed algorithms using simulated data and study their performance in the context of misinformation detection using a popular multi-modal dataset that consists of tweets published during several high-impact events.

Cite

Text

Damasceno et al. "Exploiting Sparsity and Statistical Dependence in Multivariate Data Fusion: An Application to Misinformation Detection for High-Impact Events." Machine Learning, 2024. doi:10.1007/S10994-023-06424-8

Markdown

[Damasceno et al. "Exploiting Sparsity and Statistical Dependence in Multivariate Data Fusion: An Application to Misinformation Detection for High-Impact Events." Machine Learning, 2024.](https://mlanthology.org/mlj/2024/damasceno2024mlj-exploiting/) doi:10.1007/S10994-023-06424-8

BibTeX

@article{damasceno2024mlj-exploiting,
  title     = {{Exploiting Sparsity and Statistical Dependence in Multivariate Data Fusion: An Application to Misinformation Detection for High-Impact Events}},
  author    = {Damasceno, Lucas P. and Rexhepi, Egzona and Shafer, Allison and Whitehouse, Ian and Japkowicz, Nathalie and Cavalcante, Charles C. and Corizzo, Roberto and Boukouvalas, Zois},
  journal   = {Machine Learning},
  year      = {2024},
  pages     = {2183-2205},
  doi       = {10.1007/S10994-023-06424-8},
  volume    = {113},
  url       = {https://mlanthology.org/mlj/2024/damasceno2024mlj-exploiting/}
}