Towards a principled approach to sense clustering – a case study of wordnet and dictionary senses in Danish

Bolette Pedersen, Manex Agirrezabal, Sanni Nimb, Ida Olsen, Sussi Olsen

[How to correct problems with metadata yourself]


Abstract
Our aim is to develop principled methods for sense clustering which can make existing lexical resources practically useful in NLP – not too fine-grained to be operational and yet finegrained enough to be worth the trouble. Where traditional dictionaries have a highly structured sense inventory typically describing the vocabulary by means of mainand subsenses, wordnets are generally fine-grained and unstructured. We present a series of clustering and annotation experiments with 10 of the most polysemous nouns in Danish. We combine the structured information of a traditional Danish dictionary with the ontological types found in the Danish wordnet, DanNet. This constellation enables us to automatically cluster senses in a principled way and improve inter-annotator agreement and wsd performance.
Anthology ID:
2018.gwc-1.21
Volume:
Proceedings of the 9th Global Wordnet Conference
Month:
January
Year:
2018
Address:
Nanyang Technological University (NTU), Singapore
Editors:
Francis Bond, Piek Vossen, Christiane Fellbaum
Venue:
GWC
SIG:
SIGLEX
Publisher:
Global Wordnet Association
Note:
Pages:
182–189
Language:
URL:
https://aclanthology.org/2018.gwc-1.21
DOI:
Bibkey:
Cite (ACL):
Bolette Pedersen, Manex Agirrezabal, Sanni Nimb, Ida Olsen, and Sussi Olsen. 2018. Towards a principled approach to sense clustering – a case study of wordnet and dictionary senses in Danish. In Proceedings of the 9th Global Wordnet Conference, pages 182–189, Nanyang Technological University (NTU), Singapore. Global Wordnet Association.
Cite (Informal):
Towards a principled approach to sense clustering – a case study of wordnet and dictionary senses in Danish (Pedersen et al., GWC 2018)
Copy Citation:
PDF:
https://preview.aclanthology.org/teach-a-man-to-fish/2018.gwc-1.21.pdf