Stefan Klatt


Fixing paper assignments

  1. Please select all papers that belong to the same person.
  2. Indicate below which author they should be assigned to.
Provide a valid ORCID iD here. This will be used to match future papers to this author.
Provide the name of the school or the university where the author has received or will receive their highest degree (e.g., Ph.D. institution for researchers, or current affiliation for students). This will be used to form the new author page ID, if needed.

TODO: "submit" and "cancel" buttons here


2006

pdf bib
A Corpus-based Approach to the Interpretation of Unknown Words with an Application to German
Stefan Klatt
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

Usually a high portion of the different word forms in a corpusreceive no reading by the lexical and/or morphological analysis. These unknown words constitute a huge problem for NLP analysis tasks likePOS-tagging or syntactic parsing. We present a parameterizable (in principle language-independent) corpus-basedapproach for the interpretation of unknown words that only needs a tokenizedcorpus and can be used in both offline and online applications. In combination with a few linguistic (language-dependent) rules unknown verbs, adjectives, nouns, multiword units etc. are identified. Depending on the recognized word class(es), more detailed morphosyntactic and semantic information is additionally identified in opposite to the majority ofother unknown word guessing methods,which only uses a very narrow decision window to assign an unknown wordits correct reading respective Part-of-Speech tag in a given text. We tested our approach by experiments with German data and received very promising results.

2004

pdf bib
A High Quality Partial Parser for Annotating German Text Corpora
Stefan Klatt
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

Search
Co-authors
    Venues
    Fix data