2022
pdf
abs
Morpholex Turkish: A Morphological Lexicon for Turkish
Bilge Arican
|
Aslı Kuzgun
|
Büşra Marşan
|
Deniz Baran Aslan
|
Ezgi Saniyar
|
Neslihan Cesur
|
Neslihan Kara
|
Oguzhan Kuyrukcu
|
Merve Ozcelik
|
Arife Betul Yenice
|
Merve Dogan
|
Ceren Oksal
|
Gökhan Ercan
|
Olcay Taner Yıldız
Proceedings of Globalex Workshop on Linked Lexicography within the 13th Language Resources and Evaluation Conference
MorphoLex is a study in which root, prefix and suffixes of words are analyzed. With MorphoLex, many words can be analyzed according to certain rules and a useful database can be created. Due to the fact that Turkish is an agglutinative language and the richness of its language structure, it offers different analyzes and results from previous studies in MorphoLex. In this study, we revealed the process of creating a database with 48,472 words and the results of the differences in language structure.
2021
pdf
abs
Creating Domain Dependent Turkish WordNet and SentiNet
Bilge Nas Arıcan
|
Merve Özçelik
|
Deniz Baran Aslan
|
Elif Sarmış
|
Selen Parlar
|
Olcay Taner Yıldız
Proceedings of the 11th Global Wordnet Conference
A WordNet is a thesaurus that has a structured list of words organized depending on their meanings. WordNet represents word senses, all meanings a single lemma may have, the relations between these senses, and their definitions. Another study within the domain of Natural Language Processing is sentiment analysis. With sentiment analysis, data sets can be scored according to the emotion they contain. In the sentiment analysis we did with the data we received on the Tourism WordNet, we performed a domain-specific sentiment analysis study by annotating the data. In this paper, we propose a method to facilitate Natural Language Processing tasks such as sentiment analysis performed in specific domains via creating a specific-domain subset of an original Turkish dictionary. As the preliminary study, we have created a WordNet for the tourism domain with 14,000 words and validated it on simple tasks.
2020
pdf
abs
TRopBank: Turkish PropBank V2.0
Neslihan Kara
|
Deniz Baran Aslan
|
Büşra Marşan
|
Özge Bakay
|
Koray Ak
|
Olcay Taner Yıldız
Proceedings of the Twelfth Language Resources and Evaluation Conference
In this paper, we present and explain TRopBank “Turkish PropBank v2.0”. PropBank is a hand-annotated corpus of propositions which is used to obtain the predicate-argument information of a language. Predicate-argument information of a language can help understand semantic roles of arguments. “Turkish PropBank v2.0”, unlike PropBank v1.0, has a much more extensive list of Turkish verbs, with 17.673 verbs in total.