Ida Olsen


2018

pdf
Towards a principled approach to sense clustering – a case study of wordnet and dictionary senses in Danish
Bolette Pedersen | Manex Agirrezabal | Sanni Nimb | Ida Olsen | Sussi Olsen
Proceedings of the 9th Global Wordnet Conference

Our aim is to develop principled methods for sense clustering which can make existing lexical resources practically useful in NLP – not too fine-grained to be operational and yet finegrained enough to be worth the trouble. Where traditional dictionaries have a highly structured sense inventory typically describing the vocabulary by means of mainand subsenses, wordnets are generally fine-grained and unstructured. We present a series of clustering and annotation experiments with 10 of the most polysemous nouns in Danish. We combine the structured information of a traditional Danish dictionary with the ontological types found in the Danish wordnet, DanNet. This constellation enables us to automatically cluster senses in a principled way and improve inter-annotator agreement and wsd performance.