Learning Thesaurus Relations from Distributional Features
Rosa Tsegaye Aga, Christian Wartena, Lucas Drumond, Lars Schmidt-Thieme
Abstract
In distributional semantics words are represented by aggregated context features. The similarity of words can be computed by comparing their feature vectors. Thus, we can predict whether two words are synonymous or similar with respect to some other semantic relation. We will show on six different datasets of pairs of similar and non-similar words that a supervised learning algorithm on feature vectors representing pairs of words outperforms cosine similarity between vectors representing single words. We compared different methods to construct a feature vector representing a pair of words. We show that simple methods like pairwise addition or multiplication give better results than a recently proposed method that combines different types of features. The semantic relation we consider is relatedness of terms in thesauri for intellectual document classification. Thus our findings can directly be applied for the maintenance and extension of such thesauri. To the best of our knowledge this relation was not considered before in the field of distributional semantics.- Anthology ID:
- L16-1328
- Volume:
- Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
- Month:
- May
- Year:
- 2016
- Address:
- Portorož, Slovenia
- Editors:
- Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 2071–2075
- Language:
- URL:
- https://aclanthology.org/L16-1328
- DOI:
- Cite (ACL):
- Rosa Tsegaye Aga, Christian Wartena, Lucas Drumond, and Lars Schmidt-Thieme. 2016. Learning Thesaurus Relations from Distributional Features. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 2071–2075, Portorož, Slovenia. European Language Resources Association (ELRA).
- Cite (Informal):
- Learning Thesaurus Relations from Distributional Features (Aga et al., LREC 2016)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-3/L16-1328.pdf