Éva Csató Johanson


2006

pdf
Building a Swedish-Turkish Parallel Corpus
Beáta Bandmann Megyesi | Anna Sågvall Hein | Éva Csató Johanson
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

We present a SwedishTurkish Parallel Corpus aimed to be used in linguistic research, teaching, and applications in natural language processing, primarily machine translation. The corpus being under development is built by using a Basic LAnguage Resource Kit (BLARK) for the two languages which is then used in the automatic alignment phase to improve alignment accuracy. The corpus is balanced with respect to source and target language and is automatically processed using the Uplug toolkit.