Nathan Wolf
2026
Lost in Translation, and Found: Detecting and Interpreting Translation Effects
Shira Wein | Anna Serbina | Jiyuan Ji | Nathan Wolf | Jason DeGraaff | Prajakta Kini | Maria Leonor Pacheco
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Shira Wein | Anna Serbina | Jiyuan Ji | Nathan Wolf | Jason DeGraaff | Prajakta Kini | Maria Leonor Pacheco
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Translationese refers to the statistical patterns that distinguish translated texts from original texts, which are often subtle and imperceptible to human readers. When translated texts appear in either training or testing data, these patterns can negatively affect model performance or warp model evaluation. We approach the task of discerning whether a text was originally written in English or translated into English by fine-tuning contemporary foundation models at distinct item lengths and achieve state-of-the-art performance (94% Macro F1). Given that these linguistic cues are subtle and often imperceptible to humans, we analyze the features which enable our model’s high performance. Employing a suite of interpretability-based techniques, we find that: (1) our high accuracy is enabled by a collection of linguistic features, a number of which correspond with linguistic theories of translationese, and (2) pretrained neural models are adept at picking up these features without any fine-tuning.