Watch Out Your Album! on the Inadvertent Privacy Memorization in Multi-Modal Large Language Models

Abstract

Multi-Modal Large Language Models (MLLMs) have exhibited remarkable performance on various vision-language tasks such as Visual Question Answering (VQA). Despite accumulating evidence of privacy concerns associated with task-relevant content, it remains unclear whether MLLMs inadvertently memorize private content that is entirely irrelevant to the training tasks. In this paper, we investigate how randomly generated task-irrelevant private content can become spuriously correlated with downstream objectives due to partial mini-batch training dynamics, thus causing inadvertent memorization. Concretely, we randomly generate task-irrelevant watermarks into VQA fine-tuning images at varying probabilities and propose a novel probing framework to determine whether MLLMs have inadvertently encoded such content. Our experiments reveal that MLLMs exhibit notably different training behaviors in partial mini-batch settings with task-irrelevant watermarks embedded. Furthermore, through layer-wise probing, we demonstrate that MLLMs trigger distinct representational patterns when encountering previously seen task-irrelevant knowledge, even if this knowledge does not influence their output during prompting. Our code is available at https://github.com/illusionhi/ProbingPrivacy.

Cite

Text

Ju et al. "Watch Out Your Album! on the Inadvertent Privacy Memorization in Multi-Modal Large Language Models." Proceedings of the 42nd International Conference on Machine Learning, 2025.

Markdown

[Ju et al. "Watch Out Your Album! on the Inadvertent Privacy Memorization in Multi-Modal Large Language Models." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/ju2025icml-watch/)

BibTeX

@inproceedings{ju2025icml-watch,
  title     = {{Watch Out Your Album! on the Inadvertent Privacy Memorization in Multi-Modal Large Language Models}},
  author    = {Ju, Tianjie and Hua, Yi and Fei, Hao and Shao, Zhenyu and Zheng, Yubin and Zhao, Haodong and Lee, Mong-Li and Hsu, Wynne and Zhang, Zhuosheng and Liu, Gongshen},
  booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
  year      = {2025},
  pages     = {28446-28462},
  volume    = {267},
  url       = {https://mlanthology.org/icml/2025/ju2025icml-watch/}
}