Scoring the Translation: On Target Automatic Keyword-Based Evaluation of Machine Translation in the Sports Domain

Steinþór Steingrímsson, Einar Sigurdsson


Abstract
We take a closer look at the results of a recent translation shared task at WMT 2025 (the Conference on Machine Translation) and analyse the errors in the output of the four highest-scoring systems. We revise the automatic evaluation method used in Sigurðsson et al. (2025) and compare it to manual evaluation of six machine translation systems. We find that our results are in line with the manual evaluation, indicating that the test suite can be well suited for evaluating machine translation in this domain. Finally, we publish a list of domain-specific sports terms, namely, in the domains of basketball, chess, football, golf and gymnastics.
Anthology ID:
2026.lrec-main.697
Volume:
Proceedings of the Fifteenth Language Resources and Evaluation Conference
Month:
May
Year:
2026
Address:
Palma de Mallorca, Spain
Editors:
Stelios Piperidis, Núria Bel, Henk van den Heuvel, Nancy Ide, Simon Krek, Antonio Toral
Venue:
LREC
SIG:
Publisher:
ELRA Language Resource Association
Note:
Pages:
8862–8871
Language:
URL:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.697/
DOI:
Bibkey:
Cite (ACL):
Steinþór Steingrímsson and Einar Sigurdsson. 2026. Scoring the Translation: On Target Automatic Keyword-Based Evaluation of Machine Translation in the Sports Domain. International Conference on Language Resources and Evaluation, main:8862–8871.
Cite (Informal):
Scoring the Translation: On Target Automatic Keyword-Based Evaluation of Machine Translation in the Sports Domain (Steingrímsson & Sigurdsson, LREC 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.697.pdf