Abstract
This paper introduces language processing resources and tools for Bornholmsk, a language spoken on the island of Bornholm, with roots in Danish and closely related to Scanian. This presents an overview of the language and available data, and the first NLP models for this living, minority Nordic language. Sammenfattnijng på borrijnholmst: Dæjnna artikkelijn introduserer natursprågsresurser å varktoi for borrijnholmst, ed språg a dær snakkes på ön Borrijnholm me rødder i danst å i nær familia me skånst. Artikkelijn gjer ed âuersyn âuer språged å di datan som fijnnes, å di fosste NLP modællarna for dætta læwenes nordiska minnretâlsspråged.- Anthology ID:
- W19-6138
- Volume:
- Proceedings of the 22nd Nordic Conference on Computational Linguistics
- Month:
- September–October
- Year:
- 2019
- Address:
- Turku, Finland
- Editors:
- Mareike Hartmann, Barbara Plank
- Venue:
- NoDaLiDa
- SIG:
- Publisher:
- Linköping University Electronic Press
- Note:
- Pages:
- 338–344
- Language:
- URL:
- https://aclanthology.org/W19-6138
- DOI:
- Cite (ACL):
- Leon Derczynski and Alex Speed Kjeldsen. 2019. Bornholmsk Natural Language Processing: Resources and Tools. In Proceedings of the 22nd Nordic Conference on Computational Linguistics, pages 338–344, Turku, Finland. Linköping University Electronic Press.
- Cite (Informal):
- Bornholmsk Natural Language Processing: Resources and Tools (Derczynski & Kjeldsen, NoDaLiDa 2019)
- PDF:
- https://preview.aclanthology.org/improve-issue-templates/W19-6138.pdf
- Code
- StrombergNLP/bornholmsk
- Data
- Bornholmsk, Universal Dependencies