Abstract
Machine Translation (MT) has become an integral part of daily life for millions of people, with its output being so fluent that users often cannot distinguish it from human translation. However, these fluid texts often harbor algorithmic traces, from limited lexical choices to societal misrepresentations. This raises concerns about the possible effects of MT on natural language and human communication and calls for regular evaluations of machine-generated translations for different languages. Our paper explores the output of three widely used engines (Google, DeepL, Microsoft Azure) and one smaller commercial system. We translate the English and French source texts of seven diverse parallel corpora into German and compare MT-produced texts to human references in terms of lexical, syntactic, and morphological features. Additionally, we investigate how MT leverages lexical borrowings and analyse the distribution of anglicisms across the German translations.- Anthology ID:
- 2023.nodalida-1.22
- Volume:
- Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)
- Month:
- May
- Year:
- 2023
- Address:
- Tórshavn, Faroe Islands
- Editors:
- Tanel Alumäe, Mark Fishel
- Venue:
- NoDaLiDa
- SIG:
- Publisher:
- University of Tartu Library
- Note:
- Pages:
- 215–227
- Language:
- URL:
- https://preview.aclanthology.org/build-pipeline-with-new-library/2023.nodalida-1.22/
- DOI:
- Cite (ACL):
- Anastassia Shaitarova, Anne Göhring, and Martin Volk. 2023. Machine vs. Human: Exploring Syntax and Lexicon in German Translations, with a Spotlight on Anglicisms. In Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa), pages 215–227, Tórshavn, Faroe Islands. University of Tartu Library.
- Cite (Informal):
- Machine vs. Human: Exploring Syntax and Lexicon in German Translations, with a Spotlight on Anglicisms (Shaitarova et al., NoDaLiDa 2023)
- PDF:
- https://preview.aclanthology.org/build-pipeline-with-new-library/2023.nodalida-1.22.pdf