Deep Associations, High Creativity: A Simple yet Effective Metric for Evaluating Large Language Models

Ziliang Qiu; Renfen Hu (胡韧奋)

Deep Associations, High Creativity: A Simple yet Effective Metric for Evaluating Large Language Models

Abstract

The evaluation of LLMs’ creativity represents a crucial research domain, though challenges such as data contamination and costly human assessments often impede progress. Drawing inspiration from human creativity assessment, we propose PACE, asking LLMs to generate Parallel Chains of Associations to Evaluate their creativity. PACE minimizes the risk of data contamination and offers a straightforward, highly efficient evaluation, as evidenced by its strong correlation with Arena Creative Writing (Spearman’s 𝜌 = 0.739, p < 0.001) on various proprietary and open-source models. A comparative analysis of associative creativity between LLMs and humans reveals that while high-performing LLMs achieve scores comparable to average human performance, top-performing humans consistently outperform LLMs. Furthermore, linguistic analysis reveals that both humans and LLMs exhibit a trend of decreasing concreteness in their associations, and humans demonstrating a greater diversity of associative patterns.

Anthology ID:: 2025.emnlp-main.550
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 10870–10883
Language:
URL:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.550/
DOI:
Bibkey:
Cite (ACL):: Ziliang Qiu and Renfen Hu. 2025. Deep Associations, High Creativity: A Simple yet Effective Metric for Evaluating Large Language Models. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 10870–10883, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Deep Associations, High Creativity: A Simple yet Effective Metric for Evaluating Large Language Models (Qiu & Hu, EMNLP 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.550.pdf
Checklist:: 2025.emnlp-main.550.checklist.pdf

PDF Cite Search Checklist Fix data