Handling synset overgeneration: Sense Merging in BTB-WN

Ivaylo Radev, Zara Kancheva


Abstract
The paper reports on an effort to reconsider the representation of some cases of derivational paradigm patterns in Bulgarian. The new treatment implemented within BulTreeBank-WordNet (BTB-WN), a wordnet for Bulgarian, is the grouping together of related words that have a common main meaning in the same synset while the nuances in sense are to be encoded within the synset as a modification functions over the main meaning. In this way, we can solve the following challenges: (1) to avoid the influence of English Wordnet (EWN) synset distinctions over Bulgarian that was a result from the translation of some of the synsets from Core WordNet; (2) to represent the common meaning of such derivation patterns just once and to improve the management of BTB-WN, and (3) to encode idiosyncratic usages locally to the corresponding synsets instead of introducing new semantic relations.
Anthology ID:
2021.ranlp-srw.21
Volume:
Proceedings of the Student Research Workshop Associated with RANLP 2021
Month:
September
Year:
2021
Address:
Online
Venue:
RANLP
SIG:
Publisher:
INCOMA Ltd.
Note:
Pages:
154–161
Language:
URL:
https://aclanthology.org/2021.ranlp-srw.21
DOI:
Bibkey:
Cite (ACL):
Ivaylo Radev and Zara Kancheva. 2021. Handling synset overgeneration: Sense Merging in BTB-WN. In Proceedings of the Student Research Workshop Associated with RANLP 2021, pages 154–161, Online. INCOMA Ltd..
Cite (Informal):
Handling synset overgeneration: Sense Merging in BTB-WN (Radev & Kancheva, RANLP 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/paclic-22-ingestion/2021.ranlp-srw.21.pdf