Evie Coussé
2020
The EDGeS Diachronic Bible Corpus
Gerlof Bouma
|
Evie Coussé
|
Trude Dijkstra
|
Nicoline van der Sijs
Proceedings of the Twelfth Language Resources and Evaluation Conference
We present the EDGeS Diachronic Bible Corpus: a diachronically and synchronically parallel corpus of Bible translations in Dutch, English, German and Swedish, with texts from the 14th century until today. It is compiled in the context of an intended longitudinal and contrastive study of complex verb constructions in Germanic. The paper discusses the corpus design principles, its selection of 36 Bibles, and the information and metadata encoded for the corpus texts. The EDGeS corpus will be available in two forms: the whole corpus will be accessible for researchers behind a login in the well-known OPUS search infrastructure, and the open subpart of the corpus will be available for download.
2006
Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus
Evie Coussé
|
Steven Gillis
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)
In this paper, we assess an aspect of the quality of the broad phonetic transcriptions in the Spoken Dutch Corpus (CGN). The corpus contains speech from native speakers of Dutch originating from The Netherlands and the Dutch speaking part of Belgium. The phonetic transcriptions were made by transcribers from both regions. In previous research, we have identified regional differences in the transcribers' behaviour. In this paper, we explore the precise sources of the regional bias in the CGN transcriptions and we evaluate its impact on the phonetic transcriptions. More specifically, (1) the regional bias in the canonical transcriptions that served as the basis for the verification task of the transcribers is critically analysed, and (2) we verify in an experiment the regional bias introduced by the transcribers themselves. The possible effects of this inherent regional bias in the CGN transcriptions on subsequent linguistic analyses are briefly discussed.
2004
The Influence of the Labeller’s Regional Background on Phonetic Transcriptions: Implications for the Evaluation of Spoken Language Resources
Evie Coussé
|
Steven Gillis
|
Hanne Kloots
|
Marc Swerts
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)
Search
Co-authors
- Steven Gillis 2
- Gerlof Bouma 1
- Trude Dijkstra 1
- Nicoline van der Sijs 1
- Hanne Kloots 1
- show all...
Venues
- lrec3