Hanne Eckhoff


2020

pdf
A Diachronic Treebank of Russian Spanning More Than a Thousand Years
Aleksandrs Berdicevskis | Hanne Eckhoff
Proceedings of the Twelfth Language Resources and Evaluation Conference

We describe the Tromsø Old Russian and Old Church Slavonic Treebank (TOROT) that spans from the earliest Old Church Slavonic to modern Russian texts, covering more than a thousand years of continuous language history. We focus on the latest additions to the treebank, first of all, the modern subcorpus that was created by a high-quality conversion of the existing treebank of contemporary standard Russian (SynTagRus).