Harald Lüngen

Also published as: Harald Lungen


2022

pdf bib
Proceedings of the Workshop on Challenges in the Management of Large Corpora (CMLC-10)
Piotr Banski | Adrien Barbaresi | Simon Clematide | Marc Kupietz | Harald Lüngen
Proceedings of the Workshop on Challenges in the Management of Large Corpora (CMLC-10)

2020

pdf bib
Proceedings of the 8th Workshop on Challenges in the Management of Large Corpora
Piotr Bański | Adrien Barbaresi | Simon Clematide | Marc Kupietz | Harald Lüngen | Ines Pisetta
Proceedings of the 8th Workshop on Challenges in the Management of Large Corpora

2018

pdf
The German Reference Corpus DeReKo: New Developments – New Opportunities
Marc Kupietz | Harald Lüngen | Paweł Kamocki | Andreas Witt
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

2014

pdf
Recent Developments in DeReKo
Marc Kupietz | Harald Lüngen
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)

This paper gives an overview of recent developments in the German Reference Corpus DeReKo in terms of growth, maximising relevant corpus strata, metadata, legal issues, and its current and future research interface. Due to the recent acquisition of new licenses, DeReKo has grown by a factor of four in the first half of 2014, mostly in the area of newspaper text, and presently contains over 24 billion word tokens. Other strata, like fictional texts, web corpora, in particular CMC texts, and spoken but conceptually written texts have also increased significantly. We report on the newly acquired corpora that led to the major increase, on the principles and strategies behind our corpus acquisition activities, and on our solutions for the emerging legal, organisational, and technical challenges.

2004

pdf
Text Type Structure and Logical Document Structure
Hagen Langer | Harald Lungen | Petra Saskia Bayerl
Proceedings of the Workshop on Discourse Annotation

2000

pdf
Enhancing Speech Corpus Resources with Multiple Lexical Tag Layers
Andreas Witt | Harald Lüngen | Dafydd Gibbon
Proceedings of the Second International Conference on Language Resources and Evaluation (LREC’00)