A Study on Leveraging Search and Self-Feedback for Agent Reasoning

Karthikeyan K; Michelle Yuan; Elman Mansimov; Katerina Margatina; Anurag Pratik; Daniele Bonadiman; Monica Sunkara; Yi Zhang; Yassine Benajiba

A Study on Leveraging Search and Self-Feedback for Agent Reasoning

Karthikeyan K, Michelle Yuan, Elman Mansimov, Katerina Margatina, Anurag Pratik, Daniele Bonadiman, Monica Sunkara, Yi Zhang, Yassine Benajiba

Abstract

Recent works have demonstrated that incorporating search during inference can significantly improve reasoning capabilities of language agents. Some approaches may make use of the ground truth or rely on model’s own generated feedback. The search algorithm uses this feedback to then produce values that will update its criterion for exploring and exploiting various reasoning paths. In this study, we investigate how search and model’s self-feedback can be leveraged for reasoning tasks. First, we explore differences in ground-truth feedback and self-feedback during search for math reasoning. Second, we observe limitations in applying search techniques to more complex tasks like tool-calling and design domain-specific approaches to address these gaps. Our experiments reveal challenges related to generalization when solely relying on self-feedback during search. For search to work effectively, either access to the ground-truth is needed or feedback mechanisms need to be carefully designed for the specific task.

Anthology ID:: 2025.realm-1.18
Volume:: Proceedings of the 1st Workshop for Research on Agent Language Models (REALM 2025)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Ehsan Kamalloo, Nicolas Gontier, Xing Han Lu, Nouha Dziri, Shikhar Murty, Alexandre Lacoste
Venues:: REALM | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 259–271
Language:
URL:: https://preview.aclanthology.org/landing_page/2025.realm-1.18/
DOI:
Bibkey:
Cite (ACL):: Karthikeyan K, Michelle Yuan, Elman Mansimov, Katerina Margatina, Anurag Pratik, Daniele Bonadiman, Monica Sunkara, Yi Zhang, and Yassine Benajiba. 2025. A Study on Leveraging Search and Self-Feedback for Agent Reasoning. In Proceedings of the 1st Workshop for Research on Agent Language Models (REALM 2025), pages 259–271, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: A Study on Leveraging Search and Self-Feedback for Agent Reasoning (K et al., REALM 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/landing_page/2025.realm-1.18.pdf

PDF Cite Search Fix data