Efficient Technical Term Translation: A Knowledge Distillation Approach for Parenthetical Terminology Translation
Jiyoon Myung, Jihyeon Park, Jungki Son, Kyungro Lee, Joohyung Han
Abstract
This paper addresses the challenge of accurately translating technical terms, which are crucial for clear communication in specialized fields. We introduce the Parenthetical Terminology Translation (PTT) task, designed to mitigate potential inaccuracies by displaying the original term in parentheses alongside its translation. To implement this approach, we generated a representative PTT dataset using a collaborative approach with large language models and applied knowledge distillation to fine-tune traditional Neural Machine Translation (NMT) models and small-sized Large Language Models (sLMs). Additionally, we developed a novel evaluation metric to assess both overall translation accuracy and the correct parenthetical presentation of terms. Our findings indicate that sLMs did not consistently outperform NMT models, with fine-tuning proving more effective than few-shot prompting, particularly in models with continued pre-training in the target language. These insights contribute to the advancement of more reliable terminology translation methodologies.- Anthology ID:
- 2024.wmt-1.129
- Volume:
- Proceedings of the Ninth Conference on Machine Translation
- Month:
- November
- Year:
- 2024
- Address:
- Miami, Florida, USA
- Editors:
- Barry Haddow, Tom Kocmi, Philipp Koehn, Christof Monz
- Venues:
- WMT | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 1410–1427
- Language:
- URL:
- https://preview.aclanthology.org/fix-sig-urls/2024.wmt-1.129/
- DOI:
- 10.18653/v1/2024.wmt-1.129
- Cite (ACL):
- Jiyoon Myung, Jihyeon Park, Jungki Son, Kyungro Lee, and Joohyung Han. 2024. Efficient Technical Term Translation: A Knowledge Distillation Approach for Parenthetical Terminology Translation. In Proceedings of the Ninth Conference on Machine Translation, pages 1410–1427, Miami, Florida, USA. Association for Computational Linguistics.
- Cite (Informal):
- Efficient Technical Term Translation: A Knowledge Distillation Approach for Parenthetical Terminology Translation (Myung et al., WMT 2024)
- PDF:
- https://preview.aclanthology.org/fix-sig-urls/2024.wmt-1.129.pdf