Abstract
The paper explores the application of plWordNet, a very large wordnet of Polish, in weakly supervised Word Sense Disambiguation (WSD). Because plWordNet provides only partial descriptions by glosses and usage examples, and does not include sense-disambiguated glosses, PageRank-based WSD methods perform slightly worse than for English. However, we show that the use of weights for the relation types and the order in which lexical units have been added for sense re-ranking can significantly improve WSD precision. The evaluation was done on two Polish corpora (KPWr and Składnica) including manual WSD. We discuss the fundamental difference in the construction of both corpora and very different test results.- Anthology ID:
- 2016.gwc-1.41
- Volume:
- Proceedings of the 8th Global WordNet Conference (GWC)
- Month:
- 27--30 January
- Year:
- 2016
- Address:
- Bucharest, Romania
- Editors:
- Christiane Fellbaum, Piek Vossen, Verginica Barbu Mititelu, Corina Forascu
- Venue:
- GWC
- SIG:
- SIGLEX
- Publisher:
- Global Wordnet Association
- Note:
- Pages:
- 282–291
- Language:
- URL:
- https://aclanthology.org/2016.gwc-1.41
- DOI:
- Cite (ACL):
- Maciej Piasecki, Paweł Kędzia, and Marlena Orlińska. 2016. plWordNet in Word Sense Disambiguation task. In Proceedings of the 8th Global WordNet Conference (GWC), pages 282–291, Bucharest, Romania. Global Wordnet Association.
- Cite (Informal):
- plWordNet in Word Sense Disambiguation task (Piasecki et al., GWC 2016)
- PDF:
- https://preview.aclanthology.org/ml4al-ingestion/2016.gwc-1.41.pdf