Maksim Kulaev


2022

pdf
Sense-Annotated Corpus for Russian
Alexander Kirillovich | Natalia Loukachevitch | Maksim Kulaev | Angelina Bolshina | Dmitry Ilvovsky
Proceedings of the Fifth International Conference on Computational Linguistics in Bulgaria (CLIB 2022)

We present a sense-annotated corpus for Russian. The resource was obtained my manually annotating texts from the OpenCorpora corpus, an open corpus for the Russian language, by senses of Russian wordnet RuWordNet. The annotation was used as a test collection for comparing unsupervised (Personalized Pagerank) and pseudo-labeling methods for Russian word sense disambiguation.