Watch Out Your Album! on the Inadvertent Privacy Memorization in Multi-Modal Large Language Models
Abstract
Multi-Modal Large Language Models (MLLMs) have exhibited remarkable performance on various vision-language tasks such as Visual Question Answering (VQA). Despite accumulating evidence of privacy concerns associated with task-relevant content, it remains unclear whether MLLMs inadvertently memorize private content that is entirely irrelevant to the training tasks. In this paper, we investigate how randomly generated task-irrelevant private content can become spuriously correlated with downstream objectives due to partial mini-batch training dynamics, thus causing inadvertent memorization. Concretely, we randomly generate task-irrelevant watermarks into VQA fine-tuning images at varying probabilities and propose a novel probing framework to determine whether MLLMs have inadvertently encoded such content. Our experiments reveal that MLLMs exhibit notably different training behaviors in partial mini-batch settings with task-irrelevant watermarks embedded. Furthermore, through layer-wise probing, we demonstrate that MLLMs trigger distinct representational patterns when encountering previously seen task-irrelevant knowledge, even if this knowledge does not influence their output during prompting. Our code is available at https://github.com/illusionhi/ProbingPrivacy.
Cite
Text
Ju et al. "Watch Out Your Album! on the Inadvertent Privacy Memorization in Multi-Modal Large Language Models." Proceedings of the 42nd International Conference on Machine Learning, 2025.Markdown
[Ju et al. "Watch Out Your Album! on the Inadvertent Privacy Memorization in Multi-Modal Large Language Models." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/ju2025icml-watch/)BibTeX
@inproceedings{ju2025icml-watch,
title = {{Watch Out Your Album! on the Inadvertent Privacy Memorization in Multi-Modal Large Language Models}},
author = {Ju, Tianjie and Hua, Yi and Fei, Hao and Shao, Zhenyu and Zheng, Yubin and Zhao, Haodong and Lee, Mong-Li and Hsu, Wynne and Zhang, Zhuosheng and Liu, Gongshen},
booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
year = {2025},
pages = {28446-28462},
volume = {267},
url = {https://mlanthology.org/icml/2025/ju2025icml-watch/}
}