Adversarial Yet Cooperative: Multi-Perspective Reasoning in Retrieved-Augmented Language Models

Can Xu, Lingyong Yan, Jiayi Wu, Haosen Wang, Shuaiqiang Wang, Yuchen Li, Jizhou Huang, Dawei Yin, Xiang Li


Abstract
Recent advances in synergizing large reasoning models (LRMs) with retrieval-augmented generation (RAG) have shown promising results, yet two critical challenges remain: (1) reasoning models typically operate from a single, unchallenged perspective, limiting their ability to conduct deep, self-correcting reasoning over external documents, and (2) existing training paradigms rely excessively on outcome-oriented rewards, which provide insufficient signal for shaping the complex, multi-step reasoning process. To address these issues, we propose an Reasoner-Verifier framework named Adversarial Reasoning RAG (ARR). The Reasoner and Verifier engage in reasoning on retrieved evidence and critiquing each other’s logic while being guided by process-aware advantage that requires no external scoring model. This reward combines explicit observational signals with internal model uncertainty to jointly optimize reasoning fidelity and verification rigor. Experiments on multiple benchmarks demonstrate the effectiveness of our method. Our code is available at [link](https://github.com/lakhfskn/anonymous-code-of-arr).
Anthology ID:
2026.findings-acl.1868
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
37469–37480
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1868/
DOI:
Bibkey:
Cite (ACL):
Can Xu, Lingyong Yan, Jiayi Wu, Haosen Wang, Shuaiqiang Wang, Yuchen Li, Jizhou Huang, Dawei Yin, and Xiang Li. 2026. Adversarial Yet Cooperative: Multi-Perspective Reasoning in Retrieved-Augmented Language Models. In Findings of the Association for Computational Linguistics: ACL 2026, pages 37469–37480, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Adversarial Yet Cooperative: Multi-Perspective Reasoning in Retrieved-Augmented Language Models (Xu et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1868.pdf
Checklist:
 2026.findings-acl.1868.checklist.pdf