Annelen Brunner


Annelen Brunner | Stefan Engelberg | Fotis Jannidis | Ngoc Duyen Tanja Tu | Lukas Weimer
Proceedings of the Twelfth Language Resources and Evaluation Conference

This article presents corpus REDEWIEDERGABE, a German-language historical corpus with detailed annotations for speech, thought and writing representation (ST&WR). With approximately 490,000 tokens, it is the largest resource of its kind. It can be used to answer literary and linguistic research questions and serve as training material for machine learning. This paper describes the composition of the corpus and the annotation structure, discusses some methodological decisions and gives basic statistics about the forms of ST&WR found in this corpus.


Contexts, Patterns, Interrelations - New Ways of Presenting Multi-word Expressions
Kathrin Steyer | Annelen Brunner
Proceedings of the 10th Workshop on Multiword Expressions (MWE)


Pronominal anaphora resolution in the KANTOO multilingual machine translation system
Teruko Mitamura | Eric Nyberg | Enrique Torrejon | Dave Svoboda | Annelen Brunner | Kathryn Baker
Proceedings of the 9th Conference on Theoretical and Methodological Issues in Machine Translation of Natural Languages: Papers