Learning to Translate by Translating: Stabilizing the Dual Loop via Semantic-Aware Self-Evolution

Kui Liu, Mingming Yin, Zuoli Tang, Zihao Li, Chilin Fu, Xiaolu Zhang, Jun Zhou, Lixin Zou, Chenliang Li


Abstract
Despite the remarkable success of Large Language Models (LLMs) in Machine Translation (MT), the scarcity of high-quality parallel corpora and the prohibitive cost of their acquisition constrain scalability. To this end, we propose Learning to Translate by Translating (LTT), an LLM-driven dual-learning framework that enables autonomous translation, achieving an 80.42% performance improvement over the base model. By adapting the cycle-consistency principle to the generative paradigm, LTT eliminates the need for parallel data. It employs a robust semantic-aware reward function that balances adequacy with reconstruction fidelity, effectively mitigating the reward hacking issues inherent in traditional unsupervised MT. Relying solely on monolingual data, our 8B model consistently outperforms significantly larger models (70B+) in low-resource settings and achieves parity with state-of-the-art supervised baselines on mainstream benchmarks. LTT thus offers a scalable, data-efficient paradigm for autonomous machine translation.
Anthology ID:
2026.findings-acl.723
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
14711–14726
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.723/
DOI:
Bibkey:
Cite (ACL):
Kui Liu, Mingming Yin, Zuoli Tang, Zihao Li, Chilin Fu, Xiaolu Zhang, Jun Zhou, Lixin Zou, and Chenliang Li. 2026. Learning to Translate by Translating: Stabilizing the Dual Loop via Semantic-Aware Self-Evolution. In Findings of the Association for Computational Linguistics: ACL 2026, pages 14711–14726, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Learning to Translate by Translating: Stabilizing the Dual Loop via Semantic-Aware Self-Evolution (Liu et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.723.pdf
Checklist:
 2026.findings-acl.723.checklist.pdf