Abstract
We present a pilot analysis of a new linguistic resource, VPS-GradeUp (available at http://hdl.handle.net/11234/1-1585). The resource contains 11,400 graded human decisions on usage patterns of 29 English lexical verbs, randomly selected from the Pattern Dictionary of English Verbs (Hanks, 2000 2014) based on their frequency and the number of senses their lemmas have in PDEV. This data set has been created to observe the interannotator agreement on PDEV patterns produced using the Corpus Pattern Analysis (Hanks, 2013). Apart from the graded decisions, the data set also contains traditional Word-Sense-Disambiguation (WSD) labels. We analyze the associations between the graded annotation and WSD annotation. The results of the respective annotations do not correlate with the size of the usage pattern inventory for the respective verbs lemmas, which makes the data set worth further linguistic analysis.- Anthology ID:
- L16-1137
- Volume:
- Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
- Month:
- May
- Year:
- 2016
- Address:
- Portorož, Slovenia
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 848–854
- Language:
- URL:
- https://aclanthology.org/L16-1137
- DOI:
- Cite (ACL):
- Silvie Cinková, Ema Krejčová, Anna Vernerová, and Vít Baisa. 2016. Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 848–854, Portorož, Slovenia. European Language Resources Association (ELRA).
- Cite (Informal):
- Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study (Cinková et al., LREC 2016)
- PDF:
- https://preview.aclanthology.org/nodalida-main-page/L16-1137.pdf