Quantifying Language Disparities in Multilingual Large Language Models

Songbo Hu; Ivan Vulić; Anna Korhonen

Quantifying Language Disparities in Multilingual Large Language Models

Abstract

Results reported in large-scale multilingual evaluations are often fragmented and confounded by factors such as target languages, differences in experimental setups, and model choices. We propose a framework that disentangles these confounding variables and introduces three interpretable metrics—the performance realisation ratio, its coefficient of variation, and language potential—enabling a finer-grained and more insightful quantification of actual performance disparities across both (i) models and (ii) languages. Through a case study of 13 model variants on 11 multilingual datasets, we demonstrate that our framework provides a more reliable measurement of model performance and language disparities, particularly for low-resource languages, which have so far proven challenging to evaluate. Importantly, our results reveal that higher overall model performance does not necessarily imply greater fairness across languages.

Anthology ID:: 2025.emnlp-main.199
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 4003–4018
Language:
URL:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.199/
DOI:
Bibkey:
Cite (ACL):: Songbo Hu, Ivan Vulić, and Anna Korhonen. 2025. Quantifying Language Disparities in Multilingual Large Language Models. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 4003–4018, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Quantifying Language Disparities in Multilingual Large Language Models (Hu et al., EMNLP 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.199.pdf
Checklist:: 2025.emnlp-main.199.checklist.pdf

PDF Cite Search Checklist Fix data