RAR2: Retrieval-Augmented Medical Reasoning via Thought-Driven Retrieval

Kaishuai Xu, Wenjun Hou, Yi Cheng, Wenjie Li


Abstract
Large Language Models (LLMs) have shown promising performance on diverse medical benchmarks, highlighting their potential in supporting real-world clinical tasks. Retrieval-Augmented Generation (RAG) has emerged as a key approach for mitigating knowledge gaps and hallucinations by incorporating external medical information. However, RAG still struggles with complex medical questions that require intensive reasoning, as surface-level input often fails to reflect the true knowledge needs of the task. Existing methods typically focus on refining queries without explicitly modeling the reasoning process, limiting their ability to retrieve and integrate clinically relevant knowledge. In this work, we propose RAR2, a joint learning framework that improves both Reasoning-Augmented Retrieval and Retrieval-Augmented Reasoning. RAR2 constructs a thought process to uncover implicit knowledge requirements and uses it to guide retrieval and answer generation. We build a training dataset of mixed preference pairs and apply Direct Preference Optimization (DPO) to train the model. Moreover, we design two test-time scaling strategies to explore the boundaries of our framework. Experiments demonstrate the effectiveness of RAR2 across several biomedical question answering datasets, outperforming RAG baselines with or without fine-tuning.
Anthology ID:
2025.findings-emnlp.1110
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2025
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
20386–20396
Language:
URL:
https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.1110/
DOI:
10.18653/v1/2025.findings-emnlp.1110
Bibkey:
Cite (ACL):
Kaishuai Xu, Wenjun Hou, Yi Cheng, and Wenjie Li. 2025. RAR2: Retrieval-Augmented Medical Reasoning via Thought-Driven Retrieval. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 20386–20396, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
RAR2: Retrieval-Augmented Medical Reasoning via Thought-Driven Retrieval (Xu et al., Findings 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.1110.pdf
Checklist:
 2025.findings-emnlp.1110.checklist.pdf