SmartAD: Capacity-Aligned Agent Distillation for Small Language Models

Guokai Tang, Feng Zhao


Abstract
Large language models (LLMs) show strong reasoning and decision-making ability, but their high inference cost motivates transferring agentic skills to small language models (SLMs). Agent distillation trains SLMs on full reason–act–observe trajectories from a tool-using teacher, enabling SLMs to acquire the tool-use capabilities of large teacher models. However, some teacher-agent trajectories are simply hard for the student to learn, and their compatibility with the student can vary widely; moreover, a uniform token-level loss prevents SLMs from learning the tool-use patterns and final decisions that truly drive successful reasoning. Therefore, we propose SmartAD, a capacity-aligned agent distillation framework that improves both the distilled data and the supervision signal. SmartAD (i) selects, for each training example, the trajectory with the minimum negative log-likelihood among multiple correct teacher samples to obtain student-friendly training data, and (ii) applies a segment-weighted loss that emphasizes action execution and final decision spans over intermediate reasoning. Experiments on multi-hop QA and math benchmarks with 1.5B and 3B models show that SmartAD consistently outperforms all baselines. Overall, our method enables small models to learn the teacher’s capabilities more easily and efficiently through trajectory selection and segment-weighted supervision, achieving capacity-aligned distillation.
Anthology ID:
2026.findings-acl.1349
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
27045–27057
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1349/
DOI:
Bibkey:
Cite (ACL):
Guokai Tang and Feng Zhao. 2026. SmartAD: Capacity-Aligned Agent Distillation for Small Language Models. In Findings of the Association for Computational Linguistics: ACL 2026, pages 27045–27057, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
SmartAD: Capacity-Aligned Agent Distillation for Small Language Models (Tang & Zhao, Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1349.pdf
Checklist:
 2026.findings-acl.1349.checklist.pdf