Abstract
We present SeVeN (Semantic Vector Networks), a hybrid resource that encodes relationships between words in the form of a graph. Different from traditional semantic networks, these relations are represented as vectors in a continuous vector space. We propose a simple pipeline for learning such relation vectors, which is based on word vector averaging in combination with an ad hoc autoencoder. We show that by explicitly encoding relational information in a dedicated vector space we can capture aspects of word meaning that are complementary to what is captured by word embeddings. For example, by examining clusters of relation vectors, we observe that relational similarities can be identified at a more abstract level than with traditional word vector differences. Finally, we test the effectiveness of semantic vector networks in two tasks: measuring word similarity and neural text categorization. SeVeN is available at bitbucket.org/luisespinosa/seven.- Anthology ID:
- C18-1225
- Volume:
- Proceedings of the 27th International Conference on Computational Linguistics
- Month:
- August
- Year:
- 2018
- Address:
- Santa Fe, New Mexico, USA
- Editors:
- Emily M. Bender, Leon Derczynski, Pierre Isabelle
- Venue:
- COLING
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 2653–2665
- Language:
- URL:
- https://aclanthology.org/C18-1225
- DOI:
- Cite (ACL):
- Luis Espinosa-Anke and Steven Schockaert. 2018. SeVeN: Augmenting Word Embeddings with Unsupervised Relation Vectors. In Proceedings of the 27th International Conference on Computational Linguistics, pages 2653–2665, Santa Fe, New Mexico, USA. Association for Computational Linguistics.
- Cite (Informal):
- SeVeN: Augmenting Word Embeddings with Unsupervised Relation Vectors (Espinosa-Anke & Schockaert, COLING 2018)
- PDF:
- https://preview.aclanthology.org/fix-dup-bibkey/C18-1225.pdf
- Code
- luisespinosa/seven
- Data
- ConceptNet