Alignment of Wikidata lexemes and Det Centrale Ordregister

Finn Nielsen


Abstract
Two Danish open access lexicographic resources have appeared in recent years: lexemes in Wikidata and Det Centrale Ordregister (COR). The lexeme part of Wikidata describes words in different languages and COR associates an identifier with each different form of Danish lexemes. Here I described the current state of the linking Wikidata lexemes with COR and some of the problems encountered.
Anthology ID:
2023.nodalida-1.38
Volume:
Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)
Month:
May
Year:
2023
Address:
Tórshavn, Faroe Islands
Venue:
NoDaLiDa
SIG:
Publisher:
University of Tartu Library
Note:
Pages:
366–370
Language:
URL:
https://aclanthology.org/2023.nodalida-1.38
DOI:
Bibkey:
Cite (ACL):
Finn Nielsen. 2023. Alignment of Wikidata lexemes and Det Centrale Ordregister. In Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa), pages 366–370, Tórshavn, Faroe Islands. University of Tartu Library.
Cite (Informal):
Alignment of Wikidata lexemes and Det Centrale Ordregister (Nielsen, NoDaLiDa 2023)
Copy Citation:
PDF:
https://preview.aclanthology.org/paclic-22-ingestion/2023.nodalida-1.38.pdf