Hanne Jansen


2006

pdf
The MULINCO corpus and corpus platform
Bente Maegaard | Lene Offersgaard | Lina Henriksen | Hanne Jansen | Xavier Lepetit | Costanza Navarretta | Claus Povlsen
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

The MULINCO project (MUltiLINgual Corpus of the University of Copenhagen) started early 2005. The purpose of this cross-disciplinary project is to create a corpus platform for education and research in monolingual and translation studies. The project covers two main types of corpus texts: literary and non-literary. The platform is being developed using available tools as far as possible, and integrating them in a very open architecture. In this paper we describe the current status and future developments of both the text and tool side of the corpus platform, and we show some examples of student exercises taking advantage of tagged and aligned texts.