RoCS-MT v2 at WMT 2025: Robust Challenge Set for Machine Translation

Rachel Bawden; Benoît Sagot

RoCS-MT v2 at WMT 2025: Robust Challenge Set for Machine Translation

Abstract

RoCS-MT (Robust Challenge Set for Machine Translation) was initially proposed at the test suites track of WMT 2023. Designed to challenge MT systems’ translation performance on user-generated content (UGC), it contains examples sourced from English Reddit, with manually normalised versions, aligned labelled annotation spans and reference translations in five languages. In this article, we describe version 2 of RoCS-MT in the context of the 2025 WMT test suites track. This new version contains several improvements on the initial version including (i) minor corrections of normalisation, (ii) corrections to reference translations and addition of alternative references to accommodate for different possible genders (e.g. of speakers) and (iii) a redesign and re-annotation of normalisation spans for further analysis of different non-standard UGC phenomena. We describe these changes and provide results and preliminary analysis of the MT submissions to the 2025 general translation task.

Anthology ID:: 2025.wmt-1.58
Volume:: Proceedings of the Tenth Conference on Machine Translation
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Barry Haddow, Tom Kocmi, Philipp Koehn, Christof Monz
Venue:: WMT
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 834–849
Language:
URL:: https://preview.aclanthology.org/ingest-emnlp/2025.wmt-1.58/
DOI:
Bibkey:
Cite (ACL):: Rachel Bawden and Benoît Sagot. 2025. RoCS-MT v2 at WMT 2025: Robust Challenge Set for Machine Translation. In Proceedings of the Tenth Conference on Machine Translation, pages 834–849, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: RoCS-MT v2 at WMT 2025: Robust Challenge Set for Machine Translation (Bawden & Sagot, WMT 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-emnlp/2025.wmt-1.58.pdf

PDF Cite Search Fix data