2014
pdf
A Database of Paradigmatic Semantic Relation Pairs for German Nouns, Verbs, and Adjectives
Silke Scheible
|
Sabine Schulte im Walde
Proceedings of Workshop on Lexical and Grammatical Resources for Language Processing
2013
pdf
Uncovering Distributional Differences between Synonyms and Antonyms in a Word Space Model
Silke Scheible
|
Sabine Schulte im Walde
|
Sylvia Springorum
Proceedings of the Sixth International Joint Conference on Natural Language Processing
pdf
The (Un)expected Effects of Applying Standard Cleansing Models to Human Ratings on Compositionality
Stephen Roller
|
Sabine Schulte im Walde
|
Silke Scheible
Proceedings of the 9th Workshop on Multiword Expressions
2012
pdf
abs
GATEtoGerManC: A GATE-based Annotation Pipeline for Historical German
Silke Scheible
|
Richard J. Whitt
|
Martin Durrell
|
Paul Bennett
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
We describe a new GATE-based linguistic annotation pipeline for Early Modern German, which can be used to annotate historical texts with word tokens, sentence boundaries, lemmas, and POS tags. The pipeline is based on a customisation of the freely available ANNIE system for English (Cunningham et al., 2002), in combination with a version of the TreeTagger (Schmid, 1994) trained on gold standard Early Modern German data. The POS-tagging and lemmatisation components of the pipeline achieve an average accuracy of 89.44% and 83.16%, respectively, on unseen historical data from various genres and publication dates within the Early Modern period. We show that normalisation of spelling variation can further improve these results. With no specialised tools available for processing this particular stage of the language, this pipeline will be of particular interest to smaller, humanities-based projects wishing to add linguistic annotations to their historical data but which lack the means or resources to develop such tools themselves.
2011
pdf
A Gold Standard Corpus of Early Modern German
Silke Scheible
|
Richard J. Whitt
|
Martin Durrell
|
Paul Bennett
Proceedings of the 5th Linguistic Annotation Workshop
pdf
Evaluating an ‘off-the-shelf’ POS-tagger on Early Modern German text
Silke Scheible
|
Richard J. Whitt
|
Martin Durrell
|
Paul Bennett
Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
2008
pdf
abs
Annotating Superlatives
Silke Scheible
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)
This paper describes a three-part annotation scheme for superlatives. The first identifies syntactic classes, since superlatives can serve different semantic purposes. The second and third only apply to superlatives that express straight-forward comparisons between targets and their comparison sets. The second form of annotation identifies the spans of each target and comparison set, which is of interest for relation extraction. The third form labels superlatives as facts or opinions, which has not yet been undertaken in the area of sentiment detection. The annotation scheme has been tested and evaluated on 500 tokens of superlatives, the results of which are presented in Section 5. In addition to providing a platform for investigating superlatives on a larger scale, this research also introduces a new text-based Wikipedia corpus which is especially suitable for linguistic research.
2007
pdf
Towards a Computational Treatment of Superlatives
Silke Scheible
Proceedings of the ACL 2007 Student Research Workshop