Assaf Siani
2026
MTQE.en-he: Machine Translation Quality Estimation for English-Hebrew
Andy Rosenbaum | Assaf Siani | Ilan Kernerman
Proceedings of the Second Workshop on Language Models for Low-Resource Languages (LoResLM 2026)
Andy Rosenbaum | Assaf Siani | Ilan Kernerman
Proceedings of the Second Workshop on Language Models for Low-Resource Languages (LoResLM 2026)
We release MTQE.en-he: to our knowledge,the first publicly available English-Hebrewbenchmark for Machine Translation QualityEstimation. MTQE.en-he contains 959 English segments from WMT24++, each pairedwith a machine translation into Hebrew, andDirect Assessment scores of the translationquality annotated by three human experts. Webenchmark ChatGPT prompting, TransQuest,and CometKiwi and show that ensemblingthe three models outperforms the best singlemodel (CometKiwi) by 6.4 percentage pointsPearson and 5.8 percentage points Spearman.Fine-tuning experiments with TransQuest andCometKiwi reveal that full-model updates aresensitive to overfitting and distribution collapse,yet parameter-efficient methods (LoRA, BitFit, and FTHead, i.e., fine-tuning only the classification head)train stably and yield improvements of 2-3 percentage points. MTQE.en-heand our experimental results enable future research on this under-resourced language pair.