Sense-Annotated Corpus for Russian
Alexander Kirillovich, Natalia Loukachevitch, Maksim Kulaev, Angelina Bolshina, Dmitry Ilvovsky
Abstract
We present a sense-annotated corpus for Russian. The resource was obtained my manually annotating texts from the OpenCorpora corpus, an open corpus for the Russian language, by senses of Russian wordnet RuWordNet. The annotation was used as a test collection for comparing unsupervised (Personalized Pagerank) and pseudo-labeling methods for Russian word sense disambiguation.- Anthology ID:
- 2022.clib-1.15
- Volume:
- Proceedings of the 5th International Conference on Computational Linguistics in Bulgaria (CLIB 2022)
- Month:
- September
- Year:
- 2022
- Address:
- Sofia, Bulgaria
- Venue:
- CLIB
- SIG:
- Publisher:
- Department of Computational Linguistics, IBL -- BAS
- Note:
- Pages:
- 130–136
- Language:
- URL:
- https://aclanthology.org/2022.clib-1.15
- DOI:
- Cite (ACL):
- Alexander Kirillovich, Natalia Loukachevitch, Maksim Kulaev, Angelina Bolshina, and Dmitry Ilvovsky. 2022. Sense-Annotated Corpus for Russian. In Proceedings of the 5th International Conference on Computational Linguistics in Bulgaria (CLIB 2022), pages 130–136, Sofia, Bulgaria. Department of Computational Linguistics, IBL -- BAS.
- Cite (Informal):
- Sense-Annotated Corpus for Russian (Kirillovich et al., CLIB 2022)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-4/2022.clib-1.15.pdf