Characterizing the Effects of Translation on Intertextuality using Multilingual Embedding Spaces

Hope McGovern, Hale Sirin, Tom Lippincott


Abstract
Rhetorical devices are difficult to translate, but they are crucial to the translation of literary documents. We investigate the use of multilingual embedding spaces to characterize the preservation of intertextuality, one common rhetorical device, across human and machine translation. To do so, we use Biblical texts, which are both full of intertextual references and are highly translated works. We provide a metric to characterize intertextuality at the corpus level and provide a quantitative analysis of the preservation of this rhetorical device across extant human translations and machine-generated counterparts. We go on to provide qualitative analysis of cases wherein human translations over- or underemphasize the intertextuality present in the text, whereas machine translations provide a neutral baseline. This provides support for established scholarship proposing that human translators have a propensity to amplify certain literary characteristics of the original manuscripts.
Anthology ID:
2025.naacl-short.14
Volume:
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers)
Month:
April
Year:
2025
Address:
Albuquerque, New Mexico
Editors:
Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
161–167
Language:
URL:
https://preview.aclanthology.org/fix-sig-urls/2025.naacl-short.14/
DOI:
Bibkey:
Cite (ACL):
Hope McGovern, Hale Sirin, and Tom Lippincott. 2025. Characterizing the Effects of Translation on Intertextuality using Multilingual Embedding Spaces. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers), pages 161–167, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):
Characterizing the Effects of Translation on Intertextuality using Multilingual Embedding Spaces (McGovern et al., NAACL 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/fix-sig-urls/2025.naacl-short.14.pdf