Evaluation of Text-to-Image Generation from a Creativity Perspective

Xinhao Wang, Xinyu Ma, ShengYong Ding, Derek F. Wong


Abstract
In recent years, driven by advancements in the diffusion process, Text-to-Image (T2I) models have rapidly developed. However, evaluating T2I models remains a significant challenge. While previous research has thoroughly assessed the quality of generated images and image-text alignment, there has been little study on the creativity of these models. In this work, we defined the creativity of T2I models, inspired by previous definitions of machine creativity. We also proposed corresponding metrics and designed a method to test the reliability of the metric. Additionally, we developed a fully automated pipeline capable of transforming existing image-text datasets into benchmarks tailored for evaluating creativity, specifically through text vector retrieval and the text generation capabilities of large language models (LLMs). Finally, we conducted a series of tests and analyses on the evaluation methods for T2I model creativity and the factors influencing the creativity of the models, revealing that current T2I models demonstrate a lack of creativity. The code and benchmark will be released.
Anthology ID:
2025.findings-emnlp.26
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2025
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
481–493
Language:
URL:
https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.26/
DOI:
10.18653/v1/2025.findings-emnlp.26
Bibkey:
Cite (ACL):
Xinhao Wang, Xinyu Ma, ShengYong Ding, and Derek F. Wong. 2025. Evaluation of Text-to-Image Generation from a Creativity Perspective. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 481–493, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
Evaluation of Text-to-Image Generation from a Creativity Perspective (Wang et al., Findings 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.26.pdf
Checklist:
 2025.findings-emnlp.26.checklist.pdf