Referenceless Evaluation of Machine Translation Models by Ranking Performance in Romanian to English Translate-train Settings

Mihail Feraru; Alexandra Diaconu; Bogdan Dumitru Alexe

Referenceless Evaluation of Machine Translation Models by Ranking Performance in Romanian to English Translate-train Settings

Mihail Feraru, Alexandra Diaconu, Bogdan Dumitru Alexe

Abstract

We propose a referenceless evaluation method for machine translation (MT) models by assessing their performance in translate-train scenarios across a variety of natural language processing (NLP) tasks. The approach ranks MT systems based on the downstream impact of their translations on independent NLP models trained on translated data, thus eliminating the need for professional ground-truth references. We evaluate four prominent MT tools — ChatGPT 3.5 Turbo, DeepL, Google Translate, and Mistral 7B Instruct v0.2 — on the Romanian→English language pair and analyze their influence on text summarization, sentiment analysis, and authorship identification. To further test the generalization and robustness of our method, we extend the evaluation to a cross-modality setup using out-of-domain speech data. In this setting, speech segments are transcribed with Whisper-Large, translated into English, and used in a four-class domain classification task (children’s stories, audiobooks, film dialogues, podcasts). Our findings show that translation improves downstream performance for sentiment analysis and summarization, while stylistically rich texts such as poetry or noisy ASR transcriptions suffer degradation. The proposed ranking metric correlates strongly with human judgments and remains sensitive to translation quality even in multimodal pipelines, providing a scalable and practical alternative to reference-based MT evaluation.

Anthology ID:: 2026.lrec-main.693
Volume:: Proceedings of the Fifteenth Language Resources and Evaluation Conference
Month:: May
Year:: 2026
Address:: Palma de Mallorca, Spain
Editors:: Stelios Piperidis, Núria Bel, Henk van den Heuvel, Nancy Ide, Simon Krek, Antonio Toral
Venue:: LREC
SIG:
Publisher:: ELRA Language Resource Association
Note:
Pages:: 8813–8823
Language:
URL:: https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.693/
DOI:
Bibkey:
Cite (ACL):: Mihail Feraru, Alexandra Diaconu, and Bogdan Dumitru Alexe. 2026. Referenceless Evaluation of Machine Translation Models by Ranking Performance in Romanian to English Translate-train Settings. International Conference on Language Resources and Evaluation, main:8813–8823.
Cite (Informal):: Referenceless Evaluation of Machine Translation Models by Ranking Performance in Romanian to English Translate-train Settings (Feraru et al., LREC 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.693.pdf

PDF Cite Search Fix data