A Fine-grained Sentiment Dataset for Norwegian

Lilja Øvrelid, Petter Mæhlum, Jeremy Barnes, Erik Velldal


Abstract
We here introduce NoReC_fine, a dataset for fine-grained sentiment analysis in Norwegian, annotated with respect to polar expressions, targets and holders of opinion. The underlying texts are taken from a corpus of professionally authored reviews from multiple news-sources and across a wide variety of domains, including literature, games, music, products, movies and more. We here present a detailed description of this annotation effort. We provide an overview of the developed annotation guidelines, illustrated with examples and present an analysis of inter-annotator agreement. We also report the first experimental results on the dataset, intended as a preliminary benchmark for further experiments.
Anthology ID:
2020.lrec-1.618
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
5025–5033
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.618
DOI:
Bibkey:
Cite (ACL):
Lilja Øvrelid, Petter Mæhlum, Jeremy Barnes, and Erik Velldal. 2020. A Fine-grained Sentiment Dataset for Norwegian. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 5025–5033, Marseille, France. European Language Resources Association.
Cite (Informal):
A Fine-grained Sentiment Dataset for Norwegian (Øvrelid et al., LREC 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2020.lrec-1.618.pdf
Code
 ltgoslo/norec_fine
Data
NoReC_fine