NorBench – A Benchmark for Norwegian Language Models

David Samuel; Andrey Kutuzov; Samia Touileb; Erik Velldal; Lilja Øvrelid; Egil Rønningstad; Elina Sigdel; Anna Palatkina

NorBench – A Benchmark for Norwegian Language Models

David Samuel, Andrey Kutuzov, Samia Touileb, Erik Velldal, Lilja Øvrelid, Egil Rønningstad, Elina Sigdel, Anna Palatkina

Abstract

We present NorBench: a streamlined suite of NLP tasks and probes for evaluating Norwegian language models (LMs) on standardized data splits and evaluation metrics. We also introduce a range of new Norwegian language models (both encoder and encoder-decoder based). Finally, we compare and analyze their performance, along with other existing LMs, across the different benchmark tests of NorBench.

Anthology ID:: 2023.nodalida-1.61
Volume:: Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)
Month:: May
Year:: 2023
Address:: Tórshavn, Faroe Islands
Editors:: Tanel Alumäe, Mark Fishel
Venue:: NoDaLiDa
SIG:
Publisher:: University of Tartu Library
Note:
Pages:: 618–633
Language:
URL:: https://aclanthology.org/2023.nodalida-1.61
DOI:
Bibkey:
Cite (ACL):: David Samuel, Andrey Kutuzov, Samia Touileb, Erik Velldal, Lilja Øvrelid, Egil Rønningstad, Elina Sigdel, and Anna Palatkina. 2023. NorBench – A Benchmark for Norwegian Language Models. In Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa), pages 618–633, Tórshavn, Faroe Islands. University of Tartu Library.
Cite (Informal):: NorBench – A Benchmark for Norwegian Language Models (Samuel et al., NoDaLiDa 2023)
Copy Citation:
PDF:: https://preview.aclanthology.org/landing_page/2023.nodalida-1.61.pdf

PDF Search