Reinforcing Compositional Retrieval: Retrieving Step-by-Step for Composing Informative Contexts

Quanyu Long, Jianda Chen, Zhengyuan Liu, Nancy F. Chen, Wenya Wang, Sinno Jialin Pan


Abstract
Large Language Models (LLMs) have demonstrated remarkable capabilities across numerous tasks, yet they often rely on external context to handle complex tasks. While retrieval-augmented frameworks traditionally focus on selecting top-ranked documents in a single pass, many real-world scenarios demand compositional retrieval, where multiple sources must be combined in a coordinated manner. In this work, we propose a tri-encoder sequential retriever that models this process as a Markov Decision Process (MDP), decomposing the probability of retrieving a set of elements into a sequence of conditional probabilities and allowing each retrieval step to be conditioned on previously selected examples. We train the retriever in two stages: first, we efficiently construct supervised sequential data for initial policy training; we then refine the policy to align with the LLM’s preferences using a reward grounded in the structural correspondence of generated programs. Experimental results show that our method consistently and significantly outperforms baselines, underscoring the importance of explicitly modeling inter-example dependencies. These findings highlight the potential of compositional retrieval for tasks requiring multiple pieces of evidence or examples.
Anthology ID:
2025.findings-acl.396
Volume:
Findings of the Association for Computational Linguistics: ACL 2025
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
7633–7651
Language:
URL:
https://preview.aclanthology.org/display_plenaries/2025.findings-acl.396/
DOI:
Bibkey:
Cite (ACL):
Quanyu Long, Jianda Chen, Zhengyuan Liu, Nancy F. Chen, Wenya Wang, and Sinno Jialin Pan. 2025. Reinforcing Compositional Retrieval: Retrieving Step-by-Step for Composing Informative Contexts. In Findings of the Association for Computational Linguistics: ACL 2025, pages 7633–7651, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Reinforcing Compositional Retrieval: Retrieving Step-by-Step for Composing Informative Contexts (Long et al., Findings 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/display_plenaries/2025.findings-acl.396.pdf