Improving Long-Context Translation via Self-Supervised Dual Learning

Shanbo Cheng, Shuaijie She, Yu Bao, Jianbing Zhang, Jiajun Chen, Shujian Huang


Abstract
Large language models (LLMs) with long context windows offer the potential to translate entire documents in a single pass, yet they frequently suffer from catastrophic information distortion, undermining the strict faithfulness required for translation. This challenge is compounded by the scarcity of document-level parallel data, which makes both supervised fine-tuning and reliable evaluation prohibitively expensive. We propose LongDu, a self-supervised post-training framework that improves long-document translation reliability via round-trip consistency. Given monolingual documents, LongDu samples multiple candidate translations, back-translates each candidate, and optimizes the model to prefer translations that best reconstruct the source. To make this signal robust for long-form generation, we design a reward that filters trivial failure modes (e.g., copying and local language drift) before applying a reconstruction and fluency score, enabling stable reinforcement learning without human annotations. We additionally introduce Long-CIRT, an automatic evaluation protocol that quantifies information distortion by measuring how much a LLM’s performance degrades after a translation cycle. Across multiple base models, LongDu substantially improves information retention and translation quality, with gains that generalize beyond the training length range and to unseen target languages.
Anthology ID:
2026.acl-long.1257
Volume:
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
27269–27280
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.acl-long.1257/
DOI:
Bibkey:
Cite (ACL):
Shanbo Cheng, Shuaijie She, Yu Bao, Jianbing Zhang, Jiajun Chen, and Shujian Huang. 2026. Improving Long-Context Translation via Self-Supervised Dual Learning. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 27269–27280, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Improving Long-Context Translation via Self-Supervised Dual Learning (Cheng et al., ACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.acl-long.1257.pdf
Checklist:
 2026.acl-long.1257.checklist.pdf