Zero-knowledge LLM hallucination detection and mitigation through fine-grained cross-model consistency

Aman Goel; Daniel Schwartz; Yanjun Qi

Zero-knowledge LLM hallucination detection and mitigation through fine-grained cross-model consistency

Abstract

Large language models (LLMs) have demonstrated impressive capabilities across diverse tasks, but they remain susceptible to hallucinations—generating content that appears plausible but contains factual inaccuracies. We present Finch-Zk, a black-box framework that leverages fine-grained cross-model consistency to detect and mitigate hallucinations in LLM outputs without requiring external knowledge sources. Finch-Zk introduces two key innovations: 1) a cross-model consistency checking strategy that reveals fine-grained inaccuracies by comparing responses generated by diverse models from semantically-equivalent prompts, and 2) a targeted mitigation technique that applies precise corrections to problematic segments while preserving accurate content. Experiments on the FELM dataset show Finch-Zk improves hallucination detection F1 scores by 6-39% compared to existing approaches. For mitigation, Finch-Zk achieves up to 9 absolute percentage points improvement in answer accuracy on the GPQA-diamond dataset when applied to state-of-the-art models like Llama 4 Maverick and Claude 4 Sonnet. Extensive evaluation on multiple datasets demonstrates that Finch-Zk provides a practical, deployment-ready safeguard for enhancing factual reliability in production LLM systems.

Anthology ID:: 2025.emnlp-industry.139
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track
Month:: November
Year:: 2025
Address:: Suzhou (China)
Editors:: Saloni Potdar, Lina Rojas-Barahona, Sebastien Montella
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1982–1999
Language:
URL:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-industry.139/
DOI:
Bibkey:
Cite (ACL):: Aman Goel, Daniel Schwartz, and Yanjun Qi. 2025. Zero-knowledge LLM hallucination detection and mitigation through fine-grained cross-model consistency. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track, pages 1982–1999, Suzhou (China). Association for Computational Linguistics.
Cite (Informal):: Zero-knowledge LLM hallucination detection and mitigation through fine-grained cross-model consistency (Goel et al., EMNLP 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-industry.139.pdf

PDF Cite Search Fix data