The Lacunae of Danish Natural Language Processing
Andreas Kirkedal, Barbara Plank, Leon Derczynski, Natalie Schluter
Abstract
Danish is a North Germanic language spoken principally in Denmark, a country with a long tradition of technological and scientific innovation. However, the language has received relatively little attention from a technological perspective. In this paper, we review Natural Language Processing (NLP) research, digital resources and tools which have been developed for Danish. We find that availability of models and tools is limited, which calls for work that lifts Danish NLP a step closer to the privileged languages. Dansk abstrakt: Dansk er et nordgermansk sprog, talt primært i kongeriget Danmark, et land med stærk tradition for teknologisk og videnskabelig innovation. Det danske sprog har imidlertid været genstand for relativt begrænset opmærksomhed, teknologisk set. I denne artikel gennemgår vi sprogteknologi-forskning, -ressourcer og -værktøjer udviklet for dansk. Vi konkluderer at der eksisterer et fåtal af modeller og værktøjer, hvilket indbyder til forskning som løfter dansk sprogteknologi i niveau med mere priviligerede sprog.- Anthology ID:
- W19-6141
- Volume:
- Proceedings of the 22nd Nordic Conference on Computational Linguistics
- Month:
- September–October
- Year:
- 2019
- Address:
- Turku, Finland
- Venue:
- NoDaLiDa
- SIG:
- Publisher:
- Linköping University Electronic Press
- Note:
- Pages:
- 356–362
- Language:
- URL:
- https://aclanthology.org/W19-6141
- DOI:
- Cite (ACL):
- Andreas Kirkedal, Barbara Plank, Leon Derczynski, and Natalie Schluter. 2019. The Lacunae of Danish Natural Language Processing. In Proceedings of the 22nd Nordic Conference on Computational Linguistics, pages 356–362, Turku, Finland. Linköping University Electronic Press.
- Cite (Informal):
- The Lacunae of Danish Natural Language Processing (Kirkedal et al., NoDaLiDa 2019)
- PDF:
- https://preview.aclanthology.org/nodalida-main-page/W19-6141.pdf
- Data
- Universal Dependencies