Miroslav Štola
2025
CUNI and Phrase at WMT25 MT Evaluation Task
Miroslav Hrabal
|
Ondrej Glembek
|
Aleš Tamchyna
|
Almut Silja Hildebrand
|
Alan Eckhard
|
Miroslav Štola
|
Sergio Penkale
|
Zuzana Šimečková
|
Ondřej Bojar
|
Alon Lavie
|
Craig Stewart
Proceedings of the Tenth Conference on Machine Translation
This paper describes the joint effort of Phrase a.s. and Charles University’sInstitute of Formal and Applied Linguistics (CUNI/UFAL) on the WMT25Automated Translation Quality Evaluation Systems Shared Task. Both teamsparticipated both in a collaborative and competitive manner, i.e. they eachsubmitted a system of their own as well as a contrastive joint system ensemble.In Task~1, we show that such an ensembling—if chosen in a clever way—canlead to a performance boost. We present the analysis of various kinds ofsystems comprising both “traditional” NN-based approach, as well as differentflavours of LLMs—off-the-shelf commercial models, their fine-tuned versions,but also in-house, custom-trained alternative models. In Tasks~2 and~3 we showPhrase’s approach to tackling the tasks via various GPT models: Error SpanAnnotation via the complete MQM solution using non-reasoning models (includingfine-tuned versions) in Task~2, and using reasoning models in Task~3.
Search
Fix author
Co-authors
- Ondřej Bojar 1
- Alan Eckhard 1
- Ondrej Glembek 1
- Almut Silja Hildebrand 1
- Miroslav Hrabal 1
- show all...
Venues
- wmt1