Creating Domain Dependent Turkish WordNet and SentiNet
Bilge Nas Arıcan, Merve Özçelik, Deniz Baran Aslan, Elif Sarmış, Selen Parlar, Olcay Taner Yıldız
Abstract
A WordNet is a thesaurus that has a structured list of words organized depending on their meanings. WordNet represents word senses, all meanings a single lemma may have, the relations between these senses, and their definitions. Another study within the domain of Natural Language Processing is sentiment analysis. With sentiment analysis, data sets can be scored according to the emotion they contain. In the sentiment analysis we did with the data we received on the Tourism WordNet, we performed a domain-specific sentiment analysis study by annotating the data. In this paper, we propose a method to facilitate Natural Language Processing tasks such as sentiment analysis performed in specific domains via creating a specific-domain subset of an original Turkish dictionary. As the preliminary study, we have created a WordNet for the tourism domain with 14,000 words and validated it on simple tasks.- Anthology ID:
- 2021.gwc-1.28
- Volume:
- Proceedings of the 11th Global Wordnet Conference
- Month:
- January
- Year:
- 2021
- Address:
- University of South Africa (UNISA)
- Editors:
- Piek Vossen, Christiane Fellbaum
- Venue:
- GWC
- SIG:
- SIGLEX
- Publisher:
- Global Wordnet Association
- Note:
- Pages:
- 243–251
- Language:
- URL:
- https://aclanthology.org/2021.gwc-1.28
- DOI:
- Cite (ACL):
- Bilge Nas Arıcan, Merve Özçelik, Deniz Baran Aslan, Elif Sarmış, Selen Parlar, and Olcay Taner Yıldız. 2021. Creating Domain Dependent Turkish WordNet and SentiNet. In Proceedings of the 11th Global Wordnet Conference, pages 243–251, University of South Africa (UNISA). Global Wordnet Association.
- Cite (Informal):
- Creating Domain Dependent Turkish WordNet and SentiNet (Arıcan et al., GWC 2021)
- PDF:
- https://preview.aclanthology.org/naacl-24-ws-corrections/2021.gwc-1.28.pdf