Abstract
Two Danish open access lexicographic resources have appeared in recent years: lexemes in Wikidata and Det Centrale Ordregister (COR). The lexeme part of Wikidata describes words in different languages and COR associates an identifier with each different form of Danish lexemes. Here I described the current state of the linking Wikidata lexemes with COR and some of the problems encountered.- Anthology ID:
- 2023.nodalida-1.38
- Volume:
- Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)
- Month:
- May
- Year:
- 2023
- Address:
- Tórshavn, Faroe Islands
- Editors:
- Tanel Alumäe, Mark Fishel
- Venue:
- NoDaLiDa
- SIG:
- Publisher:
- University of Tartu Library
- Note:
- Pages:
- 366–370
- Language:
- URL:
- https://aclanthology.org/2023.nodalida-1.38
- DOI:
- Cite (ACL):
- Finn Nielsen. 2023. Alignment of Wikidata lexemes and Det Centrale Ordregister. In Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa), pages 366–370, Tórshavn, Faroe Islands. University of Tartu Library.
- Cite (Informal):
- Alignment of Wikidata lexemes and Det Centrale Ordregister (Nielsen, NoDaLiDa 2023)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-4/2023.nodalida-1.38.pdf