It Takes Two: A Dual Stage Approach for Terminology-Aware Translation

Akshat Jaswal


Abstract
This paper introduces DuTerm, a novel two-stage architecture for terminology-constrained machine translation. Our system combines a terminology-aware NMT model, adapted via fine-tuning on large-scale synthetic data, with a prompt-based LLM for post-editing. The LLM stage refines NMT output and enforces terminology adherence. We evaluate DuTerm on English-to German, English-to-Spanish, and English-to-Russian for the WMT 2025 Terminology Shared Task. We demonstrate that flexible, context-driven terminology handling by the LLM consistently yields higher quality translations than strict constraint enforcement. Our results highlight a critical trade-off, revealing that an LLM’s intrinsic knowledge often provides a stronger basis for high-quality translation than rigid, externally imposed constraints.
Anthology ID:
2025.wmt-1.112
Volume:
Proceedings of the Tenth Conference on Machine Translation
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Barry Haddow, Tom Kocmi, Philipp Koehn, Christof Monz
Venue:
WMT
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1344–1350
Language:
URL:
https://preview.aclanthology.org/ingest-emnlp/2025.wmt-1.112/
DOI:
Bibkey:
Cite (ACL):
Akshat Jaswal. 2025. It Takes Two: A Dual Stage Approach for Terminology-Aware Translation. In Proceedings of the Tenth Conference on Machine Translation, pages 1344–1350, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
It Takes Two: A Dual Stage Approach for Terminology-Aware Translation (Jaswal, WMT 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-emnlp/2025.wmt-1.112.pdf