Improving Multimodal Social Media Popularity Prediction via Selective Retrieval Knowledge Augmentation

Abstract

Understanding and predicting the popularity of online User-Generated Content (UGC) is critical for various social and recommendation systems. Existing efforts have focused on extracting predictive features and using pre-trained deep models to learn and fuse multimodal UGC representations. However, the dissemination of social UGCs is not an isolated process in social network; rather, it is influenced by contextual relevant UGCs and various exogenous factors, including social ties, trends, user interests, and platform algorithms. In this work, we propose a retrieval-based framework to enhance the popularity prediction of multimodal UGCs. Our framework extends beyond a simple semantic retrieval, incorporating a meta retrieval strategy that queries a diverse set of relevant UGCs by considering multimodal content semantics, and metadata from user and post. Moreover, to eliminate irrelevant and noisy UGCs in retrieval, we introduce a new measure called Relative Retrieval Contribution to Prediction (RRCP), which selectively refines the retrieved UGCs. We then aggregate the contextual UGC knowledge using vision-language graph neural networks, and fuse them with an RRCP-Attention-based prediction network. Extensive experiments on three large-scale social media datasets demonstrate significant improvements ranging from 26.68% to 48.19% across all metrics compared to strong baselines.

Cite

Text

Xu et al. "Improving Multimodal Social Media Popularity Prediction via Selective Retrieval Knowledge Augmentation." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I1.32078

Markdown

[Xu et al. "Improving Multimodal Social Media Popularity Prediction via Selective Retrieval Knowledge Augmentation." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/xu2025aaai-improving/) doi:10.1609/AAAI.V39I1.32078

BibTeX

@inproceedings{xu2025aaai-improving,
  title     = {{Improving Multimodal Social Media Popularity Prediction via Selective Retrieval Knowledge Augmentation}},
  author    = {Xu, Xovee and Zhang, Yifan and Zhou, Fan and Song, Jingkuan},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {932-940},
  doi       = {10.1609/AAAI.V39I1.32078},
  url       = {https://mlanthology.org/aaai/2025/xu2025aaai-improving/}
}