Abstract
This paper describes the TSU HITS team’s submission system for the WMT’24 general translation task. We focused on exploring the capabilities of discrete diffusion models for the English-to-{Russian, German, Czech, Spanish} translation tasks in the constrained track. Our submission system consists of a set of discrete diffusion models for each language pair. The main advance is using a separate length regression model to determine the length of the output sequence more precisely.- Anthology ID:
- 2024.wmt-1.13
- Volume:
- Proceedings of the Ninth Conference on Machine Translation
- Month:
- November
- Year:
- 2024
- Address:
- Miami, Florida, USA
- Editors:
- Barry Haddow, Tom Kocmi, Philipp Koehn, Christof Monz
- Venue:
- WMT
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 205–209
- Language:
- URL:
- https://aclanthology.org/2024.wmt-1.13
- DOI:
- 10.18653/v1/2024.wmt-1.13
- Cite (ACL):
- Vladimir Mynka and Nikolay Mikhaylovskiy. 2024. TSU HITS’s Submissions to the WMT 2024 General Machine Translation Shared Task. In Proceedings of the Ninth Conference on Machine Translation, pages 205–209, Miami, Florida, USA. Association for Computational Linguistics.
- Cite (Informal):
- TSU HITS’s Submissions to the WMT 2024 General Machine Translation Shared Task (Mynka & Mikhaylovskiy, WMT 2024)
- PDF:
- https://preview.aclanthology.org/dois-2013-emnlp/2024.wmt-1.13.pdf