Optimizing Multi-Hop Document Retrieval Through Intermediate Representations

Linjiaen Linjiaen, Jingyu Liu, Yingbo Liu


Abstract
Retrieval-augmented generation (RAG) encounters challenges when addressing complex queries, particularly multi-hop questions. While several methods tackle multi-hop queries by iteratively generating internal queries and retrieving external documents, these approaches are computationally expensive. In this paper, we identify a three-stage information processing pattern in LLMs during layer-by-layer reasoning, consisting of extraction, processing, and subsequent extraction steps. This observation suggests that the representations in intermediate layers contain richer information compared to those in other layers. Building on this insight, we propose Layer-wise RAG (L-RAG). Unlike prior methods that focus on generating new internal queries, L-RAG leverages intermediate representations from the middle layers, which capture next-hop information, to retrieve external knowledge. L-RAG achieves performance comparable to multi-step approaches while maintaining inference overhead similar to that of standard RAG. Experimental results show that L-RAG outperforms existing RAG methods on open-domain multi-hop question-answering datasets, including MuSiQue, HotpotQA, and 2WikiMultiHopQA. The code is available in https://anonymous.4open.science/r/L-RAG-ADD5/.
Anthology ID:
2025.findings-acl.816
Volume:
Findings of the Association for Computational Linguistics: ACL 2025
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
15798–15809
Language:
URL:
https://preview.aclanthology.org/display_plenaries/2025.findings-acl.816/
DOI:
Bibkey:
Cite (ACL):
Linjiaen Linjiaen, Jingyu Liu, and Yingbo Liu. 2025. Optimizing Multi-Hop Document Retrieval Through Intermediate Representations. In Findings of the Association for Computational Linguistics: ACL 2025, pages 15798–15809, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Optimizing Multi-Hop Document Retrieval Through Intermediate Representations (Linjiaen et al., Findings 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/display_plenaries/2025.findings-acl.816.pdf