Olga Lyashevskaya


Constructing a Lexical Resource of Russian Derivational Morphology
Lukáš Kyjánek | Olga Lyashevskaya | Anna Nedoluzhko | Daniil Vodolazsky | Zdeněk Žabokrtský
Proceedings of the Thirteenth Language Resources and Evaluation Conference

Words of any language are to some extent related thought the ways they are formed. For instance, the verb ‘exempl-ify’ and the noun ‘example-s’ are both based on the word ‘example’, but the verb is derived from it, while the noun is inflected. In Natural Language Processing of Russian, the inflection is satisfactorily processed; however, there are only a few machine-trackable resources that capture derivations even though Russian has both of these morphological processes very rich. Therefore, we devote this paper to improving one of the methods of constructing such resources and to the application of the method to a Russian lexicon, which results in the creation of the largest lexical resource of Russian derivational relations. The resulting database dubbed DeriNet.RU includes more than 300 thousand lexemes connected with more than 164 thousand binary derivational relations. To create such data, we combined the existing machine-learning methods that we improved to manage this goal. The whole approach is evaluated on our newly created data set of manual, parallel annotation. The resulting DeriNet.RU is freely available under an open license agreement.


REALEC learner treebank: annotation principles and evaluation of automatic parsing
Olga Lyashevskaya | Irina Panteleeva
Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories


Learning Computational Linguistics through NLP Evaluation Events: the experience of Russian evaluation initiative
Anastasia Bonch-Osmolovskaya | Svetlana Toldova | Olga Lyashevskaya
Proceedings of the Fourth Workshop on Teaching NLP and CL


RU-EVAL-2012: Evaluating Dependency Parsers for Russian
Anastasia Gareyshina | Maxim Ionov | Olga Lyashevskaya | Dmitry Privoznov | Elena Sokolova | Svetlana Toldova
Proceedings of COLING 2012: Posters


Bank of Russian Constructions and Valencies
Olga Lyashevskaya
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)

The Bank of Russian Constructions and Valencies (Russian FrameBank) is an annotation project that takes as input samples from the Russian National Corpus (http://www.ruscorpora.ru). Since Russian verbs and predicates from other POS classes have their particular and not always predictable case pattern, these words and their argument structures are to be described as lexical constructions. The slots of partially filled phrasal constructions (e.g. vzjal i uexal ‘he suddenly (lit. took and) went away’) are also under analysis. Thus, the notion of construction is understood in the sense of Fillmore’s Construction Grammar and is not limited to that of argument structure of verbs. FrameBank brings together the dictionary of constructions and the annotated collection of examples. Our goal is to mark the set of arguments and adjuncts of a certain construction. The main focus is on realization of the elements in the running text, to facilitate searches through pattern realizations by a certain combination of features. The relevant dataset involves lexical, POS and other morphosyntactic tags, semantic classes, as well as grammatical constructions that introduce or license the use of elements within a given construction.