Representing ELMo embeddings as two-dimensional text online

Andrey Kutuzov, Elizaveta Kuzmenko


Abstract
We describe a new addition to the WebVectors toolkit which is used to serve word embedding models over the Web. The new ELMoViz module adds support for contextualized embedding architectures, in particular for ELMo models. The provided visualizations follow the metaphor of ‘two-dimensional text’ by showing lexical substitutes: words which are most semantically similar in context to the words of the input sentence. The system allows the user to change the ELMo layers from which token embeddings are inferred. It also conveys corpus information about the query words and their lexical substitutes (namely their frequency tiers and parts of speech). The module is well integrated into the rest of the WebVectors toolkit, providing lexical hyperlinks to word representations in static embedding models. Two web services have already implemented the new functionality with pre-trained ELMo models for Russian, Norwegian and English.
Anthology ID:
2021.eacl-demos.18
Volume:
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations
Month:
April
Year:
2021
Address:
Online
Venue:
EACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
143–148
Language:
URL:
https://aclanthology.org/2021.eacl-demos.18
DOI:
10.18653/v1/2021.eacl-demos.18
Bibkey:
Cite (ACL):
Andrey Kutuzov and Elizaveta Kuzmenko. 2021. Representing ELMo embeddings as two-dimensional text online. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, pages 143–148, Online. Association for Computational Linguistics.
Cite (Informal):
Representing ELMo embeddings as two-dimensional text online (Kutuzov & Kuzmenko, EACL 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2021.eacl-demos.18.pdf