Century: A Framework and Dataset for Evaluating Historical Contextualisation of Sensitive Images

Abstract

How do multi-modal generative models describe images of recent historical events and figures, whose legacies may be nuanced, multifaceted, or contested? This task necessitates not only accurate visual recognition, but also socio-cultural knowledge and cross-modal reasoning. To address this evaluation challenge, we introduce Century -- a novel dataset of sensitive historical images. This dataset consists of 1,500 images from recent history, created through an automated method combining knowledge graphs and language models with quality and diversity criteria created from the practices of museums and digital archives. We demonstrate through automated and human evaluation that this method produces a set of images that depict events and figures that are diverse across topics and represents all regions of the world. We additionally propose an evaluation framework for evaluating the historical contextualisation capabilities along dimensions of accuracy, thoroughness, and objectivity. We demonstrate this approach by using Century to evaluate four foundation models, scoring performance using both automated and human evaluation. We find that historical contextualisation of sensitive images poses a significant challenge for modern multi-modal foundation models, and offer practical recommendations for how developers can use Century to evaluate improvements to models and applications.

Cite

Text

Akbulut et al. "Century: A Framework and Dataset for Evaluating Historical Contextualisation of Sensitive Images." International Conference on Learning Representations, 2025.

Markdown

[Akbulut et al. "Century: A Framework and Dataset for Evaluating Historical Contextualisation of Sensitive Images." International Conference on Learning Representations, 2025.](https://mlanthology.org/iclr/2025/akbulut2025iclr-century/)

BibTeX

@inproceedings{akbulut2025iclr-century,
  title     = {{Century: A Framework and Dataset for Evaluating Historical Contextualisation of Sensitive Images}},
  author    = {Akbulut, Canfer and Robinson, Kevin and Rauh, Maribeth and Albuquerque, Isabela and Wiles, Olivia and Weidinger, Laura and Rieser, Verena and Hasson, Yana and Marchal, Nahema and Gabriel, Iason and Isaac, William and Hendricks, Lisa Anne},
  booktitle = {International Conference on Learning Representations},
  year      = {2025},
  url       = {https://mlanthology.org/iclr/2025/akbulut2025iclr-century/}
}