Findings of the WMT 2024 Shared Task of the Open Language Data Initiative
Jean Maillard, Laurie Burchell, Antonios Anastasopoulos, Christian Federmann, Philipp Koehn, Skyler Wang
Abstract
We present the results of the WMT 2024 shared task of the Open Language Data Initiative. Participants were invited to contribute to the FLORES+ and MT Seed multilingual datasets, two foundational open resources that facilitate the organic expansion of language technology’s reach. We accepted ten submissions covering 16 languages, which extended the range of languages included in the datasets and improved the quality of existing data.- Anthology ID:
- 2024.wmt-1.4
- Volume:
- Proceedings of the Ninth Conference on Machine Translation
- Month:
- November
- Year:
- 2024
- Address:
- Miami, Florida, USA
- Editors:
- Barry Haddow, Tom Kocmi, Philipp Koehn, Christof Monz
- Venue:
- WMT
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 110–117
- Language:
- URL:
- https://preview.aclanthology.org/add_missing_videos/2024.wmt-1.4/
- DOI:
- 10.18653/v1/2024.wmt-1.4
- Cite (ACL):
- Jean Maillard, Laurie Burchell, Antonios Anastasopoulos, Christian Federmann, Philipp Koehn, and Skyler Wang. 2024. Findings of the WMT 2024 Shared Task of the Open Language Data Initiative. In Proceedings of the Ninth Conference on Machine Translation, pages 110–117, Miami, Florida, USA. Association for Computational Linguistics.
- Cite (Informal):
- Findings of the WMT 2024 Shared Task of the Open Language Data Initiative (Maillard et al., WMT 2024)
- PDF:
- https://preview.aclanthology.org/add_missing_videos/2024.wmt-1.4.pdf