Abstract
We here introduce NoReC_fine, a dataset for fine-grained sentiment analysis in Norwegian, annotated with respect to polar expressions, targets and holders of opinion. The underlying texts are taken from a corpus of professionally authored reviews from multiple news-sources and across a wide variety of domains, including literature, games, music, products, movies and more. We here present a detailed description of this annotation effort. We provide an overview of the developed annotation guidelines, illustrated with examples and present an analysis of inter-annotator agreement. We also report the first experimental results on the dataset, intended as a preliminary benchmark for further experiments.- Anthology ID:
- 2020.lrec-1.618
- Volume:
- Proceedings of the Twelfth Language Resources and Evaluation Conference
- Month:
- May
- Year:
- 2020
- Address:
- Marseille, France
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association
- Note:
- Pages:
- 5025–5033
- Language:
- English
- URL:
- https://aclanthology.org/2020.lrec-1.618
- DOI:
- Cite (ACL):
- Lilja Øvrelid, Petter Mæhlum, Jeremy Barnes, and Erik Velldal. 2020. A Fine-grained Sentiment Dataset for Norwegian. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 5025–5033, Marseille, France. European Language Resources Association.
- Cite (Informal):
- A Fine-grained Sentiment Dataset for Norwegian (Øvrelid et al., LREC 2020)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/2020.lrec-1.618.pdf
- Code
- ltgoslo/norec_fine
- Data
- NoReC_fine