Benchmarking Abstractive Summarisation: A Dataset of Human-authored Summaries of Norwegian News Articles

Samia Touileb, Vladislav Mikhailov, Marie Ingeborg Kroka, Lilja Øvrelid, Erik Velldal


Abstract
We introduce a dataset of high-quality human-authored summaries of news articles in Norwegian. The dataset is intended for benchmarking of the abstractive summarisation capabilities of generative language models. Each document in the dataset is provided with three different candidate gold-standard summaries written by native Norwegian speakers and all summaries are provided in both of the written variants of Norwegian – Bokmål and Nynorsk. The paper describes details on the data creation effort as well as an evaluation of existing open LLMs for Norwegian on the dataset. We also provide insights from a manual human evaluation, comparing human-authored to model generated summaries. Our results indicate that the dataset provides a challenging LLM benchmark for Norwegian summarisation capabilities.
Anthology ID:
2025.nodalida-1.73
Volume:
Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025)
Month:
march
Year:
2025
Address:
Tallinn, Estonia
Editors:
Richard Johansson, Sara Stymne
Venue:
NoDaLiDa
SIG:
Publisher:
University of Tartu Library
Note:
Pages:
729–738
Language:
URL:
https://preview.aclanthology.org/fix-sig-urls/2025.nodalida-1.73/
DOI:
Bibkey:
Cite (ACL):
Samia Touileb, Vladislav Mikhailov, Marie Ingeborg Kroka, Lilja Øvrelid, and Erik Velldal. 2025. Benchmarking Abstractive Summarisation: A Dataset of Human-authored Summaries of Norwegian News Articles. In Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025), pages 729–738, Tallinn, Estonia. University of Tartu Library.
Cite (Informal):
Benchmarking Abstractive Summarisation: A Dataset of Human-authored Summaries of Norwegian News Articles (Touileb et al., NoDaLiDa 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/fix-sig-urls/2025.nodalida-1.73.pdf