Where Confabulation Lives: Latent Feature Discovery in LLMs

Thibaud Ardoin, Yi Cai, Gerhard Wunder


Abstract
Hallucination remains a critical failure mode of large language models (LLMs), undermining their trustworthiness in real-world applications. In this work, we focus on confabulation, a foundational aspect of hallucination where the model fabricates facts about unknown entities. We introduce a targeted dataset designed to isolate and analyze this behavior across diverse prompt types. Using this dataset, and building on recent progress in interpreting LLM internals, we extract latent directions associated with confabulation using sparse projections. A simple vector-based steering method demonstrates that these directions can modulate model behavior with minimal disruption, shedding light on the inner representations that drive factual and non-factual output. Our findings contribute to a deeper mechanistic understanding of LLMs and pave the way toward more trustworthy and controllable generation. We release the code and dataset at https://github.com/Thibaud-Ardoin/where-confabulation-lives.
Anthology ID:
2025.emnlp-main.1515
Volume:
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
29801–29825
Language:
URL:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1515/
DOI:
Bibkey:
Cite (ACL):
Thibaud Ardoin, Yi Cai, and Gerhard Wunder. 2025. Where Confabulation Lives: Latent Feature Discovery in LLMs. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 29801–29825, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
Where Confabulation Lives: Latent Feature Discovery in LLMs (Ardoin et al., EMNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1515.pdf
Checklist:
 2025.emnlp-main.1515.checklist.pdf