UTER: Capturing the Human Touch in Evaluating Morphologically Rich and Low-Resource Languages

Samy Ouzerrout


Abstract
We introduce UTER, a novel automatic translation evaluation metric specifically designed for morphologically complex languages. Unlike traditional TER approaches, UTER incorporates a reordering algorithm and leverages the Sørensen-Dicse similarity measure to better account for morphological variations.Tested on morphologically rich and low resource languages from the WMT22 dataset, such as Finnish, Estonian, Kazakh, and Xhosa, UTER delivers results that align more closely with human direct assessments (DA) and outperforms benchmark metrics, including chrF and METEOR. Furthermore, its effectiveness has also been demonstrated on languages with complex writing systems, such as Chinese and Japanese, showcasing its versatility and robustness.
Anthology ID:
2025.loresmt-1.3
Volume:
Proceedings of the Eighth Workshop on Technologies for Machine Translation of Low-Resource Languages (LoResMT 2025)
Month:
May
Year:
2025
Address:
Albuquerque, New Mexico, U.S.A.
Editors:
Atul Kr. Ojha, Chao-hong Liu, Ekaterina Vylomova, Flammie Pirinen, Jonathan Washington, Nathaniel Oco, Xiaobing Zhao
Venues:
LoResMT | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
16–23
Language:
URL:
https://preview.aclanthology.org/Ingest-2025-COMPUTEL/2025.loresmt-1.3/
DOI:
Bibkey:
Cite (ACL):
Samy Ouzerrout. 2025. UTER: Capturing the Human Touch in Evaluating Morphologically Rich and Low-Resource Languages. In Proceedings of the Eighth Workshop on Technologies for Machine Translation of Low-Resource Languages (LoResMT 2025), pages 16–23, Albuquerque, New Mexico, U.S.A.. Association for Computational Linguistics.
Cite (Informal):
UTER: Capturing the Human Touch in Evaluating Morphologically Rich and Low-Resource Languages (Ouzerrout, LoResMT 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/Ingest-2025-COMPUTEL/2025.loresmt-1.3.pdf