LORAXBENCH: A Multitask, Multilingual Benchmark Suite for 20 Indonesian Languages

Alham Fikri Aji, Trevor Cohn


Abstract
As one of the world’s most populous countries, with 700 languages spoken, Indonesia is behind in terms of NLP progress. We introduce LORAXBENCH, a benchmark that focuses on low-resource languages of Indonesia and covers 6 diverse tasks: reading comprehension, open-domain QA, language inference, causal reasoning, translation, and cultural QA. Our dataset cover 20 languages, with the addition of two formality registers for three languages. We evaluate a diverse set of multilingual and region-focused LLMs and found that this benchmark is challenging. We note a visible discrepancy between performance in Indonesian and other languages, especially the low-resource ones. There is no clear lead when using a region-specific model as opposed to the general multilingual model. Lastly, we show that a change in register affects model performance, especially with registers not commonly found in social media, such as high-level politeness ‘Krama’ Javanese.
Anthology ID:
2025.emnlp-main.881
Volume:
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
17432–17457
Language:
URL:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.881/
DOI:
Bibkey:
Cite (ACL):
Alham Fikri Aji and Trevor Cohn. 2025. LORAXBENCH: A Multitask, Multilingual Benchmark Suite for 20 Indonesian Languages. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 17432–17457, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
LORAXBENCH: A Multitask, Multilingual Benchmark Suite for 20 Indonesian Languages (Aji & Cohn, EMNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.881.pdf
Checklist:
 2025.emnlp-main.881.checklist.pdf