Contextual Refinement of Translations: Large Language Models for Sentence and Document-Level Post-Editing

Sai Koneru; Miriam Exel; Matthias Huck; Jan Niehues

doi:10.18653/v1/2024.naacl-long.148

Contextual Refinement of Translations: Large Language Models for Sentence and Document-Level Post-Editing

Sai Koneru, Miriam Exel, Matthias Huck, Jan Niehues

Abstract

Large language models (LLMs) have demonstrated considerable success in various natural language processing tasks, but open-source LLMs have yet to attain state-of-the-art performance in Neural Machine Translation (NMT). Nevertheless, their significant performance in tasks demanding a broad understanding and contextual processing shows their potential for translation. To exploit these abilities, we investigate using LLMs for MT and explore recent parameter-efficient fine-tuning techniques. Surprisingly, our initial experiments found that fine-tuning with Q-LoRA for translation purposes led to performance improvements in terms of BLEU but degradation in COMET compared to in-context learning. To overcome this, we propose an alternative approach: adapting LLMs as Automatic Post-Editors (APE) rather than direct translators. Building on the ability of the LLM to handle long sequences, we also propose extending our approach to document-level translation. We show that leveraging Low-Rank-Adapter fine-tuning for APE can yield significant improvements across both sentence and document-level metrics while generalizing to out-of-domain data. Most notably, we achieve a state-of-the-art accuracy rate of 88.7% on the ContraPro test set, which assesses the model’s ability to resolve pronoun ambiguities when translating from English to German. Lastly, during manual post-editing for document-level translation, the source sentences are iteratively annotated, which can be used to refine further translations in the document. Here, we demonstrate that leveraging human corrections can significantly reduce the number of edits required for subsequent translations.

Anthology ID:: 2024.naacl-long.148
Volume:: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
Month:: June
Year:: 2024
Address:: Mexico City, Mexico
Editors:: Kevin Duh, Helena Gomez, Steven Bethard
Venue:: NAACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2711–2725
Language:
URL:: https://aclanthology.org/2024.naacl-long.148
DOI:: 10.18653/v1/2024.naacl-long.148
Bibkey:
Cite (ACL):: Sai Koneru, Miriam Exel, Matthias Huck, and Jan Niehues. 2024. Contextual Refinement of Translations: Large Language Models for Sentence and Document-Level Post-Editing. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pages 2711–2725, Mexico City, Mexico. Association for Computational Linguistics.
Cite (Informal):: Contextual Refinement of Translations: Large Language Models for Sentence and Document-Level Post-Editing (Koneru et al., NAACL 2024)
Copy Citation:
PDF:: https://preview.aclanthology.org/naacl-24-ws-corrections/2024.naacl-long.148.pdf

PDF Search