Second-order contexts from lexical substitutes for few-shot learning of word representations

Qianchu Liu, Diana McCarthy, Anna Korhonen


Abstract
There is a growing awareness of the need to handle rare and unseen words in word representation modelling. In this paper, we focus on few-shot learning of emerging concepts that fully exploits only a few available contexts. We introduce a substitute-based context representation technique that can be applied on an existing word embedding space. Previous context-based approaches to modelling unseen words only consider bag-of-word first-order contexts, whereas our method aggregates contexts as second-order substitutes that are produced by a sequence-aware sentence completion model. We experimented with three tasks that aim to test the modelling of emerging concepts. We found that these tasks show different emphasis on first and second order contexts, and our substitute-based method achieves superior performance on naturally-occurring contexts from corpora.
Anthology ID:
S19-1007
Volume:
Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics (*SEM 2019)
Month:
June
Year:
2019
Address:
Minneapolis, Minnesota
Editors:
Rada Mihalcea, Ekaterina Shutova, Lun-Wei Ku, Kilian Evang, Soujanya Poria
Venue:
*SEM
SIGs:
SIGSEM | SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
61–67
Language:
URL:
https://aclanthology.org/S19-1007
DOI:
10.18653/v1/S19-1007
Bibkey:
Cite (ACL):
Qianchu Liu, Diana McCarthy, and Anna Korhonen. 2019. Second-order contexts from lexical substitutes for few-shot learning of word representations. In Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics (*SEM 2019), pages 61–67, Minneapolis, Minnesota. Association for Computational Linguistics.
Cite (Informal):
Second-order contexts from lexical substitutes for few-shot learning of word representations (Liu et al., *SEM 2019)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-bitext-workshop/S19-1007.pdf