Mistake Notebook Learning: Batch-Clustered Failures for Training-Free Agent Adaptation

Xuanbo Su; Yingfang Zhang; Hao Luo; Xiaoteng Liu; Leo Huang

Mistake Notebook Learning: Batch-Clustered Failures for Training-Free Agent Adaptation

Xuanbo Su, Yingfang Zhang, Hao Luo, Xiaoteng Liu, Leo Huang

Abstract

With the growing adoption of Large Language Model (LLM) agents in persistent, real-world roles, they naturally encounter continuous streams of tasks and inevitable failures. A key limitation, however, is their inability to systematically learn from these mistakes, forcing them to repeat identical errors in similar contexts. Unlike prior training-free methods that primarily store raw instance-level experience or focus on retrieving successful trajectories, we propose Mistake Notebook Learning (MNL), a novel memory framework that enables agents to self-curate generalizable guidance from batch-clustered failures. This mechanism allows agents to distill shared error patterns into structured "mistake notes", updating an external memory only when batch performance improves to ensure stability. To further amplify adaptability, we integrate MNL with test-time scaling, leveraging aggregated failure patterns to actively steer the search process away from known pitfalls. Experiments on mathematical reasoning, Text-to-SQL, and interactive agent benchmarks show that MNL achieves competitive performance compared to existing memory mechanisms in both effectiveness and efficiency. These findings position structured mistake abstraction as a critical lever for robust agent evolution, enabling continuous improvement without the cost of parameter updates.

Anthology ID:: 2026.findings-acl.719
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 14629–14645
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.719/
DOI:
Bibkey:
Cite (ACL):: Xuanbo Su, Yingfang Zhang, Hao Luo, Xiaoteng Liu, and Leo Huang. 2026. Mistake Notebook Learning: Batch-Clustered Failures for Training-Free Agent Adaptation. In Findings of the Association for Computational Linguistics: ACL 2026, pages 14629–14645, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Mistake Notebook Learning: Batch-Clustered Failures for Training-Free Agent Adaptation (Su et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.719.pdf
Checklist:: 2026.findings-acl.719.checklist.pdf

PDF Cite Search Checklist Fix data