Just Use XML: Revisiting Joint Translation and Label Projection

Thennal D K, Chris Biemann, Hans Ole Hatzel


Abstract
Label projection is an effective technique for cross-lingual transfer, extending span-annotated datasets from a high-resource language to low-resource ones. Most approaches perform label projection as a separate step after machine translation, and prior work that combines the two reports degraded translation quality. We re-evaluate this claim with LabelPigeon, a novel framework that jointly performs translation and label projection via XML tags. We design a direct evaluation scheme for label projection, and find that LabelPigeon outperforms baselines and actively improves translation quality in 11 languages. We further assess translation quality across 203 languages and varying annotation complexity, finding consistent improvement attributed to additional fine-tuning. Finally, across 27 languages and three downstream tasks, we report substantial gains in cross-lingual transfer over comparable work, up to +40.2 F1 on NER. Overall, our results demonstrate that XML-tagged label projection provides effective and efficient label transfer without compromising translation quality.
Anthology ID:
2026.findings-acl.1721
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
34461–34478
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1721/
DOI:
Bibkey:
Cite (ACL):
Thennal D K, Chris Biemann, and Hans Ole Hatzel. 2026. Just Use XML: Revisiting Joint Translation and Label Projection. In Findings of the Association for Computational Linguistics: ACL 2026, pages 34461–34478, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Just Use XML: Revisiting Joint Translation and Label Projection (K et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1721.pdf
Checklist:
 2026.findings-acl.1721.checklist.pdf