Eval-RAR: Evaluation-Driven Retrieval-Augmented Reasoning via Reinforcement Learning

Heng Yu, Rui Li, Qi Liu, Wenjun Feng, Junfeng Kang, Yi Zhan


Abstract
Retrieval-augmented generation (RAG) effectively extends the knowledge boundaries of large language models (LLMs) for complex tasks, yet current paradigms typically optimize for an interleaving of reasoning and retrieval, where models fail to critically evaluate retrieved information against the target question. Most existing methods rely on sparse outcome-based rewards, failing to provide explicit supervision for the internal reasoning process or to diagnose information inadequacy. To address this, we propose Eval-RAR, an Evaluation-driven Retrieval-Augmented Reasoning framework. Eval-RAR introduces a "Search-then-Evaluate" paradigm where the model performs explicit self-evaluation after each search step, generating a rationale to either identify sufficient evidence or specify missing information to guide subsequent queries. To optimize this process, we employ reinforcement learning with a fine-grained evaluation reward, providing intermediate feedback that encourages the model to track core entities and maintain logical consistency. Experiments on seven single-hop and multi-hop QA benchmarks demonstrate that Eval-RAR outperforms existing methods.
Anthology ID:
2026.findings-acl.189
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
3900–3912
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.189/
DOI:
Bibkey:
Cite (ACL):
Heng Yu, Rui Li, Qi Liu, Wenjun Feng, Junfeng Kang, and Yi Zhan. 2026. Eval-RAR: Evaluation-Driven Retrieval-Augmented Reasoning via Reinforcement Learning. In Findings of the Association for Computational Linguistics: ACL 2026, pages 3900–3912, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Eval-RAR: Evaluation-Driven Retrieval-Augmented Reasoning via Reinforcement Learning (Yu et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.189.pdf
Checklist:
 2026.findings-acl.189.checklist.pdf