Kuralay Mukhsina


2019

pdf bib
Detecting Collocations Similarity via Logical-Linguistic Model
Nina Khairova | Svitlana Petrasova | Orken Mamyrbayev | Kuralay Mukhsina
RELATIONS - Workshop on meaning relations between phrases and sentences

Semantic similarity between collocations, along with words similarity, is one of the main issues of NLP, which must be addressed, in particular, in order to facilitate the automatic thesaurus generation. In the paper, we consider the logical-linguistic model that allows defining the relation of semantic similarity of collocations via the logical-algebraic equations. We provide the model for English, Ukrainian and Russian text corpora. The implementation for each language is slightly different in the equations of the finite predicates algebra and used linguistic resources. As a dataset for our experiment, we use 5801 pairs of sentences of Microsoft Research Paraphrase Corpus for English and more than 1 000 texts of scientific papers for Russian and Ukrainian.