Ema Krejčová
2016
VPS-GradeUp: Graded Decisions on Usage Patterns
Vít Baisa
|
Silvie Cinková
|
Ema Krejčová
|
Anna Vernerová
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
We present VPS-GradeUp ― a set of 11,400 graded human decisions on usage patterns of 29 English lexical verbs from the Pattern Dictionary of English Verbs by Patrick Hanks. The annotation contains, for each verb lemma, a batch of 50 concordances with the given lemma as KWIC, and for each of these concordances we provide a graded human decision on how well the individual PDEV patterns for this particular lemma illustrate the given concordance, indicated on a 7-point Likert scale for each PDEV pattern. With our annotation, we were pursuing a pilot investigation of the foundations of human clustering and disambiguation decisions with respect to usage patterns of verbs in context. The data set is publicly available at http://hdl.handle.net/11234/1-1585.
Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study
Silvie Cinková
|
Ema Krejčová
|
Anna Vernerová
|
Vít Baisa
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
We present a pilot analysis of a new linguistic resource, VPS-GradeUp (available at http://hdl.handle.net/11234/1-1585). The resource contains 11,400 graded human decisions on usage patterns of 29 English lexical verbs, randomly selected from the Pattern Dictionary of English Verbs (Hanks, 2000 2014) based on their frequency and the number of senses their lemmas have in PDEV. This data set has been created to observe the interannotator agreement on PDEV patterns produced using the Corpus Pattern Analysis (Hanks, 2013). Apart from the graded decisions, the data set also contains traditional Word-Sense-Disambiguation (WSD) labels. We analyze the associations between the graded annotation and WSD annotation. The results of the respective annotations do not correlate with the size of the usage pattern inventory for the respective verbs lemmas, which makes the data set worth further linguistic analysis.
2013
Rule-Based Extraction of English Verb Collocates from a Dependency-Parsed Corpus
Silvie Cinková
|
Martin Holub
|
Ema Krejčová
|
Lenka Smejkalová
Proceedings of the Second International Conference on Dependency Linguistics (DepLing 2013)
Search