Tag-Evol: Achieving Efficient Instruction Evolving via Tag Injection

Yixuan Wang, Shiqi Zhou, Chuanzhe Guo, Qingfu Zhu


Abstract
Evol-Instruct has made significant improvements as a data synthesis method in several areas. Existing methods typically rely on a fixed set of strategies to evolve, which require manual design and are monolithic in form. In addition, iterative evolution also makes the acquisition of hard samples expensive. In view of this, we propose the Tag-Evol framework, a more diverse and efficient instruction evolving method. Specifically, Tag-Evol uses diverse and specific knowledge tags as strategies to achieve controlled evolution by injecting different combinations of tags into the original instructions. Experiments with multiple backbones in mathematical and code domain benchmarks show that the proposed method generates significantly better evolved data than other methods. Furthermore, we conduct a thorough analysis of the evolved data, demonstrating that Tag-Evol is not only efficient but also generates more diverse and challenging data.
Anthology ID:
2025.findings-acl.409
Volume:
Findings of the Association for Computational Linguistics: ACL 2025
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
7856–7869
Language:
URL:
https://preview.aclanthology.org/display_plenaries/2025.findings-acl.409/
DOI:
Bibkey:
Cite (ACL):
Yixuan Wang, Shiqi Zhou, Chuanzhe Guo, and Qingfu Zhu. 2025. Tag-Evol: Achieving Efficient Instruction Evolving via Tag Injection. In Findings of the Association for Computational Linguistics: ACL 2025, pages 7856–7869, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Tag-Evol: Achieving Efficient Instruction Evolving via Tag Injection (Wang et al., Findings 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/display_plenaries/2025.findings-acl.409.pdf