DanNet2: Extending the coverage of adjectives in DanNet based on thesaurus data (project presentation)

Sanni Nimb, Bolette Pedersen, Sussi Olsen


Abstract
The paper describes work in progress in the DanNet2 project financed by the Carlsberg Foundation. The project aim is to extend the original Danish wordnet, DanNet, in several ways. Main focus is on extension of the coverage and description of the adjectives, a part of speech that was rather sparsely described in the original wordnet. We describe the methodology and initial work of semi-automatically transferring adjectives from the Danish Thesaurus to the wordnet with the aim of easily enlarging the coverage from 3,000 to approx. 13,000 adjectival synsets. Transfer is performed by manually encoding all missing adjectival subsection headwords from the thesaurus and thereafter employing a semi-automatic procedure where adjectives from the same subsection are transferred to the wordnet as either 1) near synonyms to the section’s headword, 2) hyponyms to the section’s headword, or 3) as members of the same synset as the headword. We also discuss how to deal with the problem of multiple representations of the same sense in the thesaurus, and present other types of information from the thesaurus that we plan to integrate, such as thematic and sentiment information.
Anthology ID:
2021.gwc-1.31
Volume:
Proceedings of the 11th Global Wordnet Conference
Month:
January
Year:
2021
Address:
University of South Africa (UNISA)
Editors:
Piek Vossen, Christiane Fellbaum
Venue:
GWC
SIG:
SIGLEX
Publisher:
Global Wordnet Association
Note:
Pages:
267–272
Language:
URL:
https://aclanthology.org/2021.gwc-1.31
DOI:
Bibkey:
Cite (ACL):
Sanni Nimb, Bolette Pedersen, and Sussi Olsen. 2021. DanNet2: Extending the coverage of adjectives in DanNet based on thesaurus data (project presentation). In Proceedings of the 11th Global Wordnet Conference, pages 267–272, University of South Africa (UNISA). Global Wordnet Association.
Cite (Informal):
DanNet2: Extending the coverage of adjectives in DanNet based on thesaurus data (project presentation) (Nimb et al., GWC 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/naacl-24-ws-corrections/2021.gwc-1.31.pdf