Negation in Norwegian: an annotated dataset
Petter Mæhlum, Jeremy Barnes, Robin Kurtz, Lilja Øvrelid, Erik Velldal
Abstract
This paper introduces NorecNeg – the first annotated dataset of negation for Norwegian. Negation cues and their in-sentence scopes have been annotated across more than 11K sentences spanning more than 400 documents for a subset of the Norwegian Review Corpus (NoReC). In addition to providing in-depth discussion of the annotation guidelines, we also present a first set of benchmark results based on a graph-parsing approach.- Anthology ID:
- 2021.nodalida-main.30
- Volume:
- Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa)
- Month:
- May 31--2 June
- Year:
- 2021
- Address:
- Reykjavik, Iceland (Online)
- Venue:
- NoDaLiDa
- SIG:
- Publisher:
- Linköping University Electronic Press, Sweden
- Note:
- Pages:
- 299–308
- Language:
- URL:
- https://aclanthology.org/2021.nodalida-main.30
- DOI:
- Cite (ACL):
- Petter Mæhlum, Jeremy Barnes, Robin Kurtz, Lilja Øvrelid, and Erik Velldal. 2021. Negation in Norwegian: an annotated dataset. In Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa), pages 299–308, Reykjavik, Iceland (Online). Linköping University Electronic Press, Sweden.
- Cite (Informal):
- Negation in Norwegian: an annotated dataset (Mæhlum et al., NoDaLiDa 2021)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/2021.nodalida-main.30.pdf
- Code
- ltgoslo/norec_neg
- Data
- NoReC, NoReC_fine