Heleen Hoekstra


2004

pdf
Linguistic Annotation of the Spoken Dutch Corpus: If We Had To Do It All Over Again
Ineke Schuurman | Wim Goedertier | Heleen Hoekstra | Nelleke Oostdijk | Richard Piepenbrock | Machteld Schouppe
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

After the successful completion of the Spoken Dutch Corpus (1998 -- 2003) the time is ripe to take some time to sit back and reflect on our achievements and the procedures underlying them in order to learn from our experiences. In this paper we will in particular pay attention to issues affecting the levels of linguistic annotation, but some more general issues deserve to be treated as well (bug reporting, consistency). We will try to come up with solutions, but sometimes we want to invite further discussion from other researchers.

2003

pdf
CGN, an annotated corpus of spoken Dutch
Ineke Schuurman | Machteld Schouppe | Heleen Hoekstra | Ton van der Wouden
Proceedings of 4th International Workshop on Linguistically Interpreted Corpora (LINC-03) at EACL 2003

2002

pdf
Syntactic Analysis in the Spoken Dutch Corpus (CGN)
Ton van der Wouden | Heleen Hoekstra | Michael Moortgat | Bram Renmans | Ineke Schuurman
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02)

pdf
Annotation of prominent words, prosodic boundaries and segmental lengthening by non-expert transcribers in the Spoken Dutch Corpus
Jeska Buhmann | Johanneke Caspers | Vincent J. van Heuven | Heleen Hoekstra | Jean-Pierre Martens | Marc Swerts
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02)