Andre Melo
2025
Don’t Forget the Base Retriever! A Low-Resource Graph-based Retriever for Multi-hop Question Answering
Andre Melo
|
Enting Chen
|
Pavlos Vougiouklis
|
Chenxin Diao
|
Shriram Piramanayagam
|
Ruofei Lai
|
Jeff Z. Pan
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track
Traditional Retrieval-augmented Generation systems struggle with complex multi-hop questions, which often require reasoning over multiple passages. While GraphRAG approaches address these challenges, most of them rely on expensive LLM calls. In this paper, we propose GR\small{IEVER}, a lightweight, low-resource, multi-step graph-based retriever for multi-hop QA. Unlike prior work, GR\small{IEVER} does not rely on LLMs and can perform multi-step retrieval in a few hundred milliseconds. It efficiently indexes passages alongside an associated knowledge graph and employs a hybrid retriever combined with aggressive filtering to reduce retrieval latency. Experiments on multi-hop QA datasets demonstrate that GR\small{IEVER} outperforms conventional retrievers and shows strong potential as a base retriever within multi-step agentic frameworks.
GeAR: Graph-enhanced Agent for Retrieval-augmented Generation
Zhili Shen
|
Chenxin Diao
|
Pavlos Vougiouklis
|
Pascual Merita
|
Shriram Piramanayagam
|
Enting Chen
|
Damien Graux
|
Andre Melo
|
Ruofei Lai
|
Zeren Jiang
|
Zhongyang Li
|
Ye Qi
|
Yang Ren
|
Dandan Tu
|
Jeff Z. Pan
Findings of the Association for Computational Linguistics: ACL 2025
Retrieval-augmented Generation (RAG) relies on effective retrieval capabilities, yet traditional sparse and dense retrievers inherently struggle with multi-hop retrieval scenarios. In this paper, we introduce G\small{E}\normalsize{AR}, a system that advances RAG performance through two key innovations: (i) an efficient graph expansion mechanism that augments any conventional base retriever, such as BM25, and (ii) an agent framework that incorporates the resulting graph-based retrieval into a multi-step retrieval framework. Our evaluation demonstrates G\small{E}\normalsize{AR}‘s superior retrieval capabilities across three multi-hop question answering datasets. Notably, our system achieves state-of-the-art results with improvements exceeding 10% on the challenging MuSiQue dataset, while consuming fewer tokens and requiring fewer iterations than existing multi-step retrieval systems. The project page is available at https://gear-rag.github.io.
Search
Fix author
Co-authors
- Enting Chen 2
- Chenxin Diao 2
- Ruofei Lai 2
- Jeff Z. Pan 2
- Shriram Piramanayagam 2
- show all...