Beyond Noise: Characterizing Creative Potential in Unverifiable LLM Hallucinations

Yu Yan; Chunhong Zhang; Haiyu Zhao; Ziyang Zeng; ZiHao Liu; Yongkang Wu; Jianzhou Diao; \begin{CJK*}{UTF8}{gbsn}陈奕杰\end{CJK*}; Shujie Wang; Zheng Hu

Beyond Noise: Characterizing Creative Potential in Unverifiable LLM Hallucinations

Yu Yan, Chunhong Zhang, Haiyu Zhao, Ziyang Zeng, Zihao Liu, Yongkang Wu, Jianzhou Diao, \begin{CJK*}{UTF8}{gbsn}陈奕杰\end{CJK*}, Shujie Wang, Zheng Hu

Abstract

In knowledge-intensive creative tasks, Large Language Models (LLMs) often generate outputs that extend beyond established knowledge, making direct verification against current evidence impractical. Unlike factual hallucinations checked against ground truth, such outputs arise naturally in creative generation, where extending beyond current knowledge is often the goal. Yet prior work debates whether hallucination should be suppressed or embraced without empirically analyzing this unverifiable subclass. On the ideation evaluation side, existing work focuses on individual outputs without characterizing the unverifiable space as a whole. To address this gap, we propose a novelty-verifiability characterization that distinguishes Creative Synthesis (Region A) from Groundless Fabrication (Region B), and study it through a conceptual creation task where LLMs synthesize novel scientific concepts. Through 32,400 generations across three technical domains and 1,080 human judgments, we find that Region A is non-negligible (4.7%) and robust, persisting across generation strategies, models, domains, and embedding choices. A retrospective recovery experiment further shows that LLMs can approximate post-cutoff scientific concepts in controlled combinatorial settings. Our findings suggest that the unverifiable space is not uniformly noise but exhibits empirically distinguishable internal structure, providing an empirical basis for more selective hallucination governance.[<https://github.com/YuLab1/llm-concept-creation>]

Anthology ID:: 2026.acl-long.554
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 12102–12124
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.554/
DOI:
Bibkey:
Cite (ACL):: Yu Yan, Chunhong Zhang, Haiyu Zhao, Ziyang Zeng, Zihao Liu, Yongkang Wu, Jianzhou Diao, \begin{CJK*}{UTF8}{gbsn}陈奕杰\end{CJK*}, Shujie Wang, and Zheng Hu. 2026. Beyond Noise: Characterizing Creative Potential in Unverifiable LLM Hallucinations. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 12102–12124, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Beyond Noise: Characterizing Creative Potential in Unverifiable LLM Hallucinations (Yan et al., ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.554.pdf
Checklist:: 2026.acl-long.554.checklist.pdf

PDF Cite Search Checklist Fix data