Daniel Fernandez Sanchez


2020

pdf
Supervised Hypernymy Detection in Spanish through Order Embeddings
Gun Woo Lee | Mathias Etcheverry | Daniel Fernandez Sanchez | Dina Wonsever
Proceedings of the 7th Workshop on Linked Data in Linguistics (LDL-2020)

This paper addresses the task of supervised hypernymy detection in Spanish through an order embedding and using pretrained word vectors as input. Although the task has been widely addressed in English, there is not much work in Spanish, and according to our knowledge there is not any available dataset for supervised hypernymy detection in Spanish. We built a supervised hypernymy dataset for Spanish from WordNet and corpus statistics information, with different versions according to the lexical intersection between its partitions: random and lexical split. We show the results of using the resulting dataset within an order embedding consuming pretrained word vectors as input. We show the ability of pretrained word vectors to transfer learning to unseen lexical units according to the results in the lexical split dataset. To finish, we study the results of giving additional information in training time, such as, cohyponym links and instances extracted through patterns.