Abstract
We present Charles University submissions to the WMT 22 GeneralTranslation Shared Task on Czech-Ukrainian and Ukrainian-Czech machine translation. We present two constrained submissions based on block back-translation and tagged back-translation and experiment with rule-basedromanization of Ukrainian. Our results show that the romanization onlyhas a minor effect on the translation quality.Further, we describe Charles Translator,a system that was developed in March 2022 as a response to the migrationfrom Ukraine to the Czech Republic. Compared to our constrained systems,it did not use the romanization and used some proprietary data sources.- Anthology ID:
- 2022.wmt-1.30
- Volume:
- Proceedings of the Seventh Conference on Machine Translation (WMT)
- Month:
- December
- Year:
- 2022
- Address:
- Abu Dhabi, United Arab Emirates (Hybrid)
- Venue:
- WMT
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 352–357
- Language:
- URL:
- https://aclanthology.org/2022.wmt-1.30
- DOI:
- Cite (ACL):
- Martin Popel, Jindřich Libovický, and Jindřich Helcl. 2022. CUNI Systems for the WMT 22 Czech-Ukrainian Translation Task. In Proceedings of the Seventh Conference on Machine Translation (WMT), pages 352–357, Abu Dhabi, United Arab Emirates (Hybrid). Association for Computational Linguistics.
- Cite (Informal):
- CUNI Systems for the WMT 22 Czech-Ukrainian Translation Task (Popel et al., WMT 2022)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/2022.wmt-1.30.pdf