Context-attended Adversarial Reinforcement Learning for Robust Multi-step Retrieval Augmented Generation

Yingtao Ren, Xiao Luo, Yu-Cheng Chang, Chin-teng Lin


Abstract
Multi-step retrieval-augmented generation has attracted increasing attention due to its capacity to improve the factuality of large language models with iterative retrieved knowledge. However, the performance of multi-step RAG systems is susceptible to potential retrieval noise and fabricated documents in real-world scenarios. Current approaches usually utilize supervised fine-tuning on predetermined noisy contexts to enhance the robustness. However, their performance remains inadequate when it comes to more complicated long-context scenarios due to the lack of adaptability. Towards this end, we propose a novel framework named Context-attended Adversarial Reinforcement Learning (CARE) for multi-step RAG systems against attacks. The core of our CARE is to conduct reinforcement learning on adversarial samples which are alternatingly enhanced with text gradients. In particular, our CARE includes a reward model to identify the accuracy of responses, which is minimized for the generation of adversarial samples with text gradients. These context-attended noisy samples are then utilized for reinforcement learning to maximize the rewards. The whole framework is conducted alternatingly from easy to hard samples to ensure the smoothness of the optimization. Extensive experiments on multi-step RAG benchmark datasets are conducted to validate the superiority of our proposed CARE in multiple noisy scenarios. Our code is available at https://github.com/yingtaoren/CARE.
Anthology ID:
2026.findings-acl.856
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
17305–17319
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.856/
DOI:
Bibkey:
Cite (ACL):
Yingtao Ren, Xiao Luo, Yu-Cheng Chang, and Chin-teng Lin. 2026. Context-attended Adversarial Reinforcement Learning for Robust Multi-step Retrieval Augmented Generation. In Findings of the Association for Computational Linguistics: ACL 2026, pages 17305–17319, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Context-attended Adversarial Reinforcement Learning for Robust Multi-step Retrieval Augmented Generation (Ren et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.856.pdf
Checklist:
 2026.findings-acl.856.checklist.pdf