Semantic Annotation Layer in Russian National Corpus: Lexical Classes of Nouns and Adjectives

Olga N. Lashevskaja, Olga Yu. Shemanaeva


Abstract
The paper describes the project held within Russian National Corpus (http://www.ruscorpora.ru). Beside such obligatory constituents of a linguistic corpus as POS (parts of speech) and morphological tagging RNC contains semantic annotation. Six classifications are involved in the tagging: category, taxonomy, mereology, topology, evaluation and derivational classes. The operating of the context semantic rules is shown by applying them to various polysemous nouns and adjectives. Our results demonstrate semantic tags incorporated in the context to be highly effective for WSD.
Anthology ID:
L08-1610
Volume:
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)
Month:
May
Year:
2008
Address:
Marrakech, Morocco
Editors:
Nicoletta Calzolari, Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2008/pdf/849_paper.pdf
DOI:
Bibkey:
Cite (ACL):
Olga N. Lashevskaja and Olga Yu. Shemanaeva. 2008. Semantic Annotation Layer in Russian National Corpus: Lexical Classes of Nouns and Adjectives. In Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08), Marrakech, Morocco. European Language Resources Association (ELRA).
Cite (Informal):
Semantic Annotation Layer in Russian National Corpus: Lexical Classes of Nouns and Adjectives (Lashevskaja & Shemanaeva, LREC 2008)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2008/pdf/849_paper.pdf