Investigating the Linguistic Performance of Large Language Models in Machine Translation
Shushen Manakhimova, Vivien Macketanz, Eleftherios Avramidis, Ekaterina Lapshinova-Koltunski, Sergei Bagdasarov, Sebastian Möller
Abstract
This paper summarizes the results of our test suite evaluation on 39 machine translation systems submitted at the Shared Task of the Ninth Conference of Machine Translation (WMT24). It offers a fine-grained linguistic evaluation of machine translation outputs for English–German and English–Russian, resulting from significant manual linguistic effort. Based on our results, LLMs are inferior to NMT in English–German, both in overall scores and when translating specific linguistic phenomena, such as punctuation, complex future verb tenses, and stripping. LLMs show quite a competitive performance in English-Russian, although top-performing systems might struggle with some cases of named entities and terminology, function words, mediopassive voice, and semantic roles. Additionally, some LLMs generate very verbose or empty outputs, posing challenges to the evaluation process.- Anthology ID:
- 2024.wmt-1.28
- Volume:
- Proceedings of the Ninth Conference on Machine Translation
- Month:
- November
- Year:
- 2024
- Address:
- Miami, Florida, USA
- Editors:
- Barry Haddow, Tom Kocmi, Philipp Koehn, Christof Monz
- Venues:
- WMT | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 355–371
- Language:
- URL:
- https://preview.aclanthology.org/jlcl-multiple-ingestion/2024.wmt-1.28/
- DOI:
- 10.18653/v1/2024.wmt-1.28
- Cite (ACL):
- Shushen Manakhimova, Vivien Macketanz, Eleftherios Avramidis, Ekaterina Lapshinova-Koltunski, Sergei Bagdasarov, and Sebastian Möller. 2024. Investigating the Linguistic Performance of Large Language Models in Machine Translation. In Proceedings of the Ninth Conference on Machine Translation, pages 355–371, Miami, Florida, USA. Association for Computational Linguistics.
- Cite (Informal):
- Investigating the Linguistic Performance of Large Language Models in Machine Translation (Manakhimova et al., WMT 2024)
- PDF:
- https://preview.aclanthology.org/jlcl-multiple-ingestion/2024.wmt-1.28.pdf