@inproceedings{poelman-de-lhoneux-2026-form,
title = "Form and Meaning in Intrinsic Multilingual Evaluations",
author = "Poelman, Wessel and
de Lhoneux, Miryam",
editor = "Demberg, Vera and
Inui, Kentaro and
Marquez, Llu{\'i}s",
booktitle = "Proceedings of the 19th Conference of the {E}uropean Chapter of the {A}ssociation for {C}omputational {L}inguistics (Volume 1: Long Papers)",
month = mar,
year = "2026",
address = "Rabat, Morocco",
publisher = "Association for Computational Linguistics",
url = "https://preview.aclanthology.org/ingest-eacl/2026.eacl-long.113/",
pages = "2503--2521",
ISBN = "979-8-89176-380-7",
abstract = "Intrinsic evaluation metrics for conditional language models, such as perplexity or bits-per-character, are widely used in both mono- and multilingual settings. These metrics are rather straightforward to use and compare in monolingual setups, but rest on a number of assumptions in multilingual setups. One such assumption is that comparing the perplexity of CLMs on parallel sentences is indicative of their quality since the information content (here understood as the semantic meaning) is the same. However, the metrics are inherently measuring information content in the information-theoretic sense. We make this and other such assumptions explicit and discuss their implications. We perform experiments with six metrics on two multi-parallel corpora both with mono- and multilingual models. Ultimately, we find that current metrics are not universally comparable. We look at the form-meaning debate to provide some explanation for this."
}Markdown (Informal)
[Form and Meaning in Intrinsic Multilingual Evaluations](https://preview.aclanthology.org/ingest-eacl/2026.eacl-long.113/) (Poelman & de Lhoneux, EACL 2026)
ACL
- Wessel Poelman and Miryam de Lhoneux. 2026. Form and Meaning in Intrinsic Multilingual Evaluations. In Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2503–2521, Rabat, Morocco. Association for Computational Linguistics.