ToolDNA: Autonomous Evolution of Tool Metadata for Robust Dialogue Agents

Qiuyuan Ai; Cong Wang; Jiaqi Zhang; Zengxin Han; Jie Song

ToolDNA: Autonomous Evolution of Tool Metadata for Robust Dialogue Agents

Qiuyuan Ai, Cong Wang, Jiaqi Zhang, Zengxin Han, Jie Song

Abstract

Task-oriented dialogue (TOD) systems are vital for facilitating complex, goal-directed interactions across sectors like customer support and online retail. However, they face persistent limitations: labor-intensive manual metadata tuning and sparse reinforcement learning (RL) rewards that fail to diagnose invocation errors. To address this, we propose ToolDNA, a dynamic adaptation framework enabling autonomous co-evolution of policy networks and tool metadata via RL, anchored by two synergistic loops. An RL loop optimizes policies by generating rollout trajectories (reasoning, actions, descriptive updates) from user inputs, with multi-dimensional rewards refining invocations. A tool metadata loop—coordinated by a dedicated Tool Manager—evolves metadata through policy-generated candidates during rollouts and Feedback LLM-derived refinements from historical data. These mutually reinforcing loops close traditional reward gaps, forming a closed-loop trial-error-reflection cycle for self-improvement. Extensive experiments on a real-world dataset of 3,100 customer service dialogues confirm ToolDNA’s superiority, with notable gains over baselines: it achieves +11% problem resolution and +54% accuracy over commercial LLMs with prompt engineering; +25%/+35% over supervised fine-tuning; and +15%/+15% over traditional RL baseline. Linguistic analysis corroborates evolved metadata retain semantic intent while enhancing parseability. Case studies in two typical contexts, i.e., car inventory search and loan calculation, further validates its ability to resolve critical ambiguities. ToolDNA pioneers scalable self-improvement for robust, deployable tool-augmented agents with minimal human oversight. We release our code to facilitate future research.

Anthology ID:: 2026.findings-acl.931
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 18660–18678
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.931/
DOI:
Bibkey:
Cite (ACL):: Qiuyuan Ai, Cong Wang, Jiaqi Zhang, Zengxin Han, and Jie Song. 2026. ToolDNA: Autonomous Evolution of Tool Metadata for Robust Dialogue Agents. In Findings of the Association for Computational Linguistics: ACL 2026, pages 18660–18678, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: ToolDNA: Autonomous Evolution of Tool Metadata for Robust Dialogue Agents (Ai et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.931.pdf
Checklist:: 2026.findings-acl.931.checklist.pdf

PDF Cite Search Checklist Fix data