ToolGrad: Efficient Tool-use Dataset Generation with Textual “Gradients”
Zhongyi Zhou, Kohei Uehara, Haoyu Zhang, Jingtao Zhou, Lin Gu, Ruofei Du, Zheng Xu, Tatsuya Harada
Abstract
Prior work synthesizes tool-use LLM datasets by first generating a user query, followed by complex tool-use annotations like DFS. This inherently leads to inevitable annotation failures and low efficiency in data generation. We introduce ToolGrad, an agentic framework that inverts this paradigm. ToolGrad first constructs valid tool-use chains through an iterative process guided by textual "gradients", and then synthesizes corresponding user queries. This "answer-first" approach led to ToolGrad-500, a dataset generated with more complex tool use, lower cost, and almost 100% pass rate. Experiments show that models trained on ToolGrad-500 outperform those trained on expensive baseline datasets and proprietary LLMs.- Anthology ID:
- 2026.findings-acl.950
- Volume:
- Findings of the Association for Computational Linguistics: ACL 2026
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 19040–19056
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.950/
- DOI:
- Cite (ACL):
- Zhongyi Zhou, Kohei Uehara, Haoyu Zhang, Jingtao Zhou, Lin Gu, Ruofei Du, Zheng Xu, and Tatsuya Harada. 2026. ToolGrad: Efficient Tool-use Dataset Generation with Textual “Gradients”. In Findings of the Association for Computational Linguistics: ACL 2026, pages 19040–19056, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- ToolGrad: Efficient Tool-use Dataset Generation with Textual “Gradients” (Zhou et al., Findings 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.950.pdf