[2505.17064] Synthetic History: Evaluating Visual Representations of the Past in Diffusion Models
Summary
This article evaluates how Text-to-Image diffusion models represent historical contexts, introducing a benchmark to assess their accuracy and biases.
Why It Matters
As diffusion models gain prominence in content creation, understanding their representation of historical contexts is crucial for addressing biases and inaccuracies. This research provides a framework for evaluating these models, promoting more responsible and accurate use of AI in historical imagery.
Key Takeaways
- Introduces a benchmark for evaluating historical representation in TTI models.
- Identifies systematic inaccuracies in generated historical imagery.
- Highlights the need for improved demographic representation in AI-generated content.
- Examines implicit stylistic associations and historical consistency.
- Encourages further research into the societal implications of AI-generated imagery.
Computer Science > Computer Vision and Pattern Recognition arXiv:2505.17064 (cs) [Submitted on 18 May 2025 (v1), last revised 20 Feb 2026 (this version, v3)] Title:Synthetic History: Evaluating Visual Representations of the Past in Diffusion Models Authors:Maria-Teresa De Rosa Palmini, Eva Cetinic View a PDF of the paper titled Synthetic History: Evaluating Visual Representations of the Past in Diffusion Models, by Maria-Teresa De Rosa Palmini and Eva Cetinic View PDF HTML (experimental) Abstract:As Text-to-Image (TTI) diffusion models become increasingly influential in content creation, growing attention is being directed toward their societal and cultural implications. While prior research has primarily examined demographic and cultural biases, the ability of these models to accurately represent historical contexts remains largely underexplored. To address this gap, we introduce a benchmark for evaluating how TTI models depict historical contexts. The benchmark combines HistVis, a dataset of 30,000 synthetic images generated by three state-of-the-art diffusion models from carefully designed prompts covering universal human activities across multiple historical periods, with a reproducible evaluation protocol. We evaluate generated imagery across three key aspects: (1) Implicit Stylistic Associations: examining default visual styles associated with specific eras; (2) Historical Consistency: identifying anachronisms such as modern artifacts in pre-modern contexts; and (3) ...