Abstract
The paper describes the new Russian sentiment lexicon - RuSentiLex. The lexicon was gathered from several sources: opinionated words from domain-oriented Russian sentiment vocabularies, slang and curse words extracted from Twitter, objective words with positive or negative connotations from a news collection. The words in the lexicon having different sentiment orientations in specific senses are linked to appropriate concepts of the thesaurus of Russian language RuThes. All lexicon entries are classified according to four sentiment categories and three sources of sentiment (opinion, emotion, or fact). The lexicon can serve as the first version for the construction of domain-specific sentiment lexicons or can be used for feature generation in machine-learning approaches. In this role, the RuSentiLex lexicon was utilized by the participants of the SentiRuEval-2016 Twitter reputation monitoring shared task and allowed them to achieve high results.- Anthology ID:
- L16-1186
- Volume:
- Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
- Month:
- May
- Year:
- 2016
- Address:
- Portorož, Slovenia
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 1171–1176
- Language:
- URL:
- https://aclanthology.org/L16-1186
- DOI:
- Cite (ACL):
- Natalia Loukachevitch and Anatolii Levchik. 2016. Creating a General Russian Sentiment Lexicon. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 1171–1176, Portorož, Slovenia. European Language Resources Association (ELRA).
- Cite (Informal):
- Creating a General Russian Sentiment Lexicon (Loukachevitch & Levchik, LREC 2016)
- PDF:
- https://preview.aclanthology.org/starsem-semeval-split/L16-1186.pdf