Russian-Tatar Socio-Political Thesaurus: Methodology, Challenges, the Status of the Project

Alfiya Galieva, Olga Nevzorova, Dilyara Yakubova


Abstract
This paper discusses the general methodology and important practical aspects of implementing a new bilingual lexical resource – the Russian-Tatar Socio-Political Thesaurus that is being developed on the basis of the Russian RuThes thesaurus format as a hierarchy of concepts viewed as units of thought. Each concept is linked with a set of language expressions (words and collocations) referring to it in texts (text entries). Currently the Russian-Tatar Socio-Political Thesaurus includes 6,000 concepts, while new concepts and text entries are being constantly added to it. The paper outlines main challenges of translating concept names and their text entries into Tatar, and describes ways of reflecting the specificity of the Tatar lexical-semantic system.
Anthology ID:
R17-1034
Volume:
Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017
Month:
September
Year:
2017
Address:
Varna, Bulgaria
Venue:
RANLP
SIG:
Publisher:
INCOMA Ltd.
Note:
Pages:
245–252
Language:
URL:
https://doi.org/10.26615/978-954-452-049-6_034
DOI:
10.26615/978-954-452-049-6_034
Bibkey:
Cite (ACL):
Alfiya Galieva, Olga Nevzorova, and Dilyara Yakubova. 2017. Russian-Tatar Socio-Political Thesaurus: Methodology, Challenges, the Status of the Project. In Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017, pages 245–252, Varna, Bulgaria. INCOMA Ltd..
Cite (Informal):
Russian-Tatar Socio-Political Thesaurus: Methodology, Challenges, the Status of the Project (Galieva et al., RANLP 2017)
Copy Citation:
PDF:
https://doi.org/10.26615/978-954-452-049-6_034