Mikus Grasmanis
2025
Towards a Derivational Semantics Resource for Latvian
Ilze Lokmane
|
Mikus Grasmanis
|
Agute Klints
|
Gunta Nešpore-Bērzkalne
|
Pēteris Paikens
|
Lauma Pretkalniņa
|
Laura Rituma
|
Madara Stāde
|
Evelīna Tauriņa
Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025)
In this paper we describe the implementation of the first structured resource of semantic derivational links for Latvian, basing it on the largest online dictionary Tēzaurs.lv and linking it to the Latvian WordNet. We separate two kinds of derivational links: semantic derivation links between senses and morphological derivation links between lexemes. The semantic links between senses are defined as a pair of semantic labels assigned to both ends of the link. The process of semantic linking involves revising the sense inventory of both the base word and the derivative, defining semantic labels for lexemes of four basic word classes – nouns, verbs, adjectives and adverbs, and adding the appropriate labels to the corresponding senses. We exemplify our findings with a detailed representation of sense relations between a base verb and its nominal derivatives.
2022
Towards Latvian WordNet
Peteris Paikens
|
Mikus Grasmanis
|
Agute Klints
|
Ilze Lokmane
|
Lauma Pretkalniņa
|
Laura Rituma
|
Madara Stāde
|
Laine Strankale
Proceedings of the Thirteenth Language Resources and Evaluation Conference
In this paper we describe our current work on creating a WordNet for Latvian based on the principles of the Princeton WordNet. The chosen methodology for word sense definition and sense linking is based on corpus evidence and the existing Tezaurs.lv online dictionary, ensuring a foundation that fits the Latvian language usage and existing linguistic tradition. We cover a wide set of semantic relations, including gradation sets. Currently the dataset consists of 6432 words linked in 5528 synsets, out of which 2717 synsets are considered fully completed as they have all the outgoing semantic links annotated, annotated with corpus examples for each sense and links to the English Princeton WordNet.
Search
Fix data
Co-authors
- Agute Klints 2
- Ilze Lokmane 2
- Peteris Paikens 2
- Lauma Pretkalniņa 2
- Laura Rituma 2
- show all...