Image-Guided Topic Modeling for Interpretable Privacy Classification

Abstract

Predicting and explaining the private information contained in an image in human-understandable terms is a complex and contextual task. This task is challenging even for large language models. To facilitate the understanding of privacy decisions, we propose to predict image privacy based on a set of natural language content descriptors. These content descriptors are associated with privacy scores that reflect how people perceive image content. We generate descriptors with our novel Image-guided Topic Modeling (ITM) approach. ITM leverages, via multimodality alignment, both vision information and image textual descriptions from a vision language model. We use the ITM-generated descriptors to learn a privacy predictor, Priv$\times$ITM, whose decisions are interpretable by design. Our Priv$\times$ITM classifier outperforms the reference interpretable method by 5 percentage points in accuracy and performs comparably to the current non-interpretable state-of-the-art model.

Cite

Text

Baia and Cavallaro. "Image-Guided Topic Modeling for Interpretable Privacy Classification." European Conference on Computer Vision Workshops, 2024. doi:10.1007/978-3-031-92648-8_13

Markdown

[Baia and Cavallaro. "Image-Guided Topic Modeling for Interpretable Privacy Classification." European Conference on Computer Vision Workshops, 2024.](https://mlanthology.org/eccvw/2024/baia2024eccvw-imageguided/) doi:10.1007/978-3-031-92648-8_13

BibTeX

@inproceedings{baia2024eccvw-imageguided,
  title     = {{Image-Guided Topic Modeling for Interpretable Privacy Classification}},
  author    = {Baia, Alina Elena and Cavallaro, Andrea},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2024},
  pages     = {200-217},
  doi       = {10.1007/978-3-031-92648-8_13},
  url       = {https://mlanthology.org/eccvw/2024/baia2024eccvw-imageguided/}
}