Abstract
The paper reports on an effort to reconsider the representation of some cases of derivational paradigm patterns in Bulgarian. The new treatment implemented within BulTreeBank-WordNet (BTB-WN), a wordnet for Bulgarian, is the grouping together of related words that have a common main meaning in the same synset while the nuances in sense are to be encoded within the synset as a modification functions over the main meaning. In this way, we can solve the following challenges: (1) to avoid the influence of English Wordnet (EWN) synset distinctions over Bulgarian that was a result from the translation of some of the synsets from Core WordNet; (2) to represent the common meaning of such derivation patterns just once and to improve the management of BTB-WN, and (3) to encode idiosyncratic usages locally to the corresponding synsets instead of introducing new semantic relations.- Anthology ID:
- 2021.ranlp-srw.21
- Volume:
- Proceedings of the Student Research Workshop Associated with RANLP 2021
- Month:
- September
- Year:
- 2021
- Address:
- Online
- Venue:
- RANLP
- SIG:
- Publisher:
- INCOMA Ltd.
- Note:
- Pages:
- 154–161
- Language:
- URL:
- https://aclanthology.org/2021.ranlp-srw.21
- DOI:
- Cite (ACL):
- Ivaylo Radev and Zara Kancheva. 2021. Handling synset overgeneration: Sense Merging in BTB-WN. In Proceedings of the Student Research Workshop Associated with RANLP 2021, pages 154–161, Online. INCOMA Ltd..
- Cite (Informal):
- Handling synset overgeneration: Sense Merging in BTB-WN (Radev & Kancheva, RANLP 2021)
- PDF:
- https://preview.aclanthology.org/paclic-22-ingestion/2021.ranlp-srw.21.pdf