Detecting Hallucinations in Retrieval-Augmented Generation via Semantic-level Internal Reasoning Graph

Jianpeng Hu, Yanzeng Li, Jialun Zhong, Lei Zou, Wenfa Qi


Abstract
The Retrieval-augmented generation (RAG) system based on Large language model (LLM) has made significant progress. It can effectively reduce factuality hallucinations, but faithfulness hallucinations still exist. Previous methods for detecting faithfulness hallucinations either neglect to capture the models’ internal reasoning processes or handle those features coarsely, making it difficult for discriminators to learn. This paper proposes a semantic-level internal reasoning graph-based method for detecting faithfulness hallucination. Specifically, we first extend the layer-wise relevance propagation algorithm from the token level to the semantic level, constructing an internal reasoning graph based on attribution vectors. This provides a more faithful semantic-level representation of dependency. Furthermore, we design a general framework based on a small pre-trained language model to utilize the dependencies in LLM’s reasoning for training and hallucination detection, which can dynamically adjust the pass rate of correct samples through a threshold. Experimental results demonstrate that our method achieves better overall performance compared to state-of-the-art baselines on RAGTruth and Dolly-15k. Implementation available here: https://anonymous.4open.science/r/SIRG-1022.
Anthology ID:
2026.findings-acl.1385
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
27826–27841
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1385/
DOI:
Bibkey:
Cite (ACL):
Jianpeng Hu, Yanzeng Li, Jialun Zhong, Lei Zou, and Wenfa Qi. 2026. Detecting Hallucinations in Retrieval-Augmented Generation via Semantic-level Internal Reasoning Graph. In Findings of the Association for Computational Linguistics: ACL 2026, pages 27826–27841, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Detecting Hallucinations in Retrieval-Augmented Generation via Semantic-level Internal Reasoning Graph (Hu et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1385.pdf
Checklist:
 2026.findings-acl.1385.checklist.pdf