VectorEdits: A Dataset and Benchmark for Instruction-Based Editing of Vector Graphics

Josef Kuchar; Marek Kadlčík; Michal Spiegel; Michal Štefánik

VectorEdits: A Dataset and Benchmark for Instruction-Based Editing of Vector Graphics

Josef Kuchar, Marek Kadlcik, Michal Spiegel, Michal Stefanik

Abstract

We introduce a large-scale dataset for instruction-guided vector image editing, consisting of over 270,000 pairs of SVG images paired with natural language edit instructions. Our dataset enables training and evaluation of models that modify vector graphics based on textual commands. We describe the data collection process, including image pairing via CLIP similarity and instruction generation with vision-language models. Initial experiments with state-of-the-art large language models reveal that current methods struggle to produce accurate and valid edits, underscoring the challenge of this task. To foster research in natural language-driven vector graphic generation and editing, we make our resources created within this work publicly available.

Anthology ID:: 2026.lrec-main.868
Volume:: Proceedings of the Fifteenth Language Resources and Evaluation Conference
Month:: May
Year:: 2026
Address:: Palma de Mallorca, Spain
Editors:: Stelios Piperidis, Núria Bel, Henk van den Heuvel, Nancy Ide, Simon Krek, Antonio Toral
Venue:: LREC
SIG:
Publisher:: ELRA Language Resource Association
Note:
Pages:: 11119–11124
Language:
URL:: https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.868/
DOI:
Bibkey:
Cite (ACL):: Josef Kuchar, Marek Kadlcik, Michal Spiegel, and Michal Stefanik. 2026. VectorEdits: A Dataset and Benchmark for Instruction-Based Editing of Vector Graphics. International Conference on Language Resources and Evaluation, main:11119–11124.
Cite (Informal):: VectorEdits: A Dataset and Benchmark for Instruction-Based Editing of Vector Graphics (Kuchar et al., LREC 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.868.pdf

PDF Cite Search Fix data