TempViz: On the Evaluation of Temporal Knowledge in Text-to-Image Models

Carolin Holtermann; Nina Krebs; Anne Lauscher

TempViz: On the Evaluation of Temporal Knowledge in Text-to-Image Models

Carolin Holtermann, Nina Krebs, Anne Lauscher

Abstract

Time alters the visual appearance of entities in our world, like objects, places, and animals. Thus, for accurately generating contextually-relevant images, knowledge and reasoning about time can be crucial (e.g., for generating a landscape in spring vs. in winter). Yet, although substantial work exists on understanding and improving temporal knowledge in natural language processing, research on how temporal phenomena appear and are handled in text-to-image (T2I) models remains scarce. We address this gap with TempViz, the first data set to holistically evaluate temporal knowledge in image generation, consisting of 7.9k prompts and more than 600 reference images. Using TempViz, we study the capabilities of five T2I models across five temporal knowledge categories. Human evaluation shows that temporal competence is generally weak, with no model exceeding 75% accuracy across categories. Towards larger-scale studies, we also examine automated evaluation methods, comparing several established approaches against human judgments. However, none of these approaches provides a reliable assessment of temporal cues - further indicating the pressing need for future research on temporal knowledge in T2I.

Anthology ID:: 2026.eacl-long.187
Volume:: Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Vera Demberg, Kentaro Inui, Lluís Marquez
Venue:: EACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 4006–4028
Language:
URL:: https://preview.aclanthology.org/ingest-eacl/2026.eacl-long.187/
DOI:
Bibkey:
Cite (ACL):: Carolin Holtermann, Nina Krebs, and Anne Lauscher. 2026. TempViz: On the Evaluation of Temporal Knowledge in Text-to-Image Models. In Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pages 4006–4028, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: TempViz: On the Evaluation of Temporal Knowledge in Text-to-Image Models (Holtermann et al., EACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-eacl/2026.eacl-long.187.pdf

PDF Cite Search Fix data