POLARIS: A Gödel Agent Framework for Small Language Models through Experience-Abstracted Policy Repair

Aditya Namdev Kakade, Vivek Srivastava, Shirish Karande


Abstract
Gödel agent realize recursive self-improvement: an agent inspects its own policy and traces and then modifies that policy in a tested loop. We introduce Polaris, a Gödel agent for compact models that performs policy repair via experience abstraction, turning failures into policy updates through a structured cycle of analysis, strategy formation, abstraction, and minimal code patch repair with conservative checks. Unlike response level self-correction or parameter tuning, Polaris makes policy level changes with small, auditable patches that persist in the policy and are reused on unseen instances within each benchmark. As part of the loop, the agent engages in meta reasoning: it explains its errors, proposes concrete revisions to its own policy, and then updates the policy. To enable cumulative policy refinement, we introduce experience abstraction, which distills failures into compact, reusable strategies that transfer to unseen instances. On MGSM, DROP, GPQA, and LitBench (covering arithmetic reasoning, compositional inference, graduate-level problem solving, and creative writing evaluation), a 7-billion-parameter model equipped with Polaris achieves consistent gains over the base policy and competitive baselines.
Anthology ID:
2026.findings-acl.1969
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
39505–39537
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.findings-acl.1969/
DOI:
Bibkey:
Cite (ACL):
Aditya Namdev Kakade, Vivek Srivastava, and Shirish Karande. 2026. POLARIS: A Gödel Agent Framework for Small Language Models through Experience-Abstracted Policy Repair. In Findings of the Association for Computational Linguistics: ACL 2026, pages 39505–39537, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
POLARIS: A Gödel Agent Framework for Small Language Models through Experience-Abstracted Policy Repair (Kakade et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.findings-acl.1969.pdf
Checklist:
 2026.findings-acl.1969.checklist.pdf