Abstract
Although contextualized word embeddings have led to great improvements in automatic language understanding, their potential for practical applications in document exploration and visualization has been little explored. Common visualization techniques used for, e.g., model analysis usually provide simple scatter plots of token-level embeddings that do not provide insight into their contextual use. In this work, we propose KeywordScape, a visual exploration tool that allows to overview, summarize, and explore the semantic content of documents based on their keywords. While existing keyword-based exploration tools assume that keywords have static meanings, our tool represents keywords in terms of their contextualized embeddings. Our application visualizes these embeddings in a semantic landscape that represents keywords as islands on a spherical map. This keeps keywords with similar context close to each other, allowing for a more precise search and comparison of documents.- Anthology ID:
- 2022.emnlp-demos.14
- Volume:
- Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: System Demonstrations
- Month:
- December
- Year:
- 2022
- Address:
- Abu Dhabi, UAE
- Venue:
- EMNLP
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 137–147
- Language:
- URL:
- https://aclanthology.org/2022.emnlp-demos.14
- DOI:
- Cite (ACL):
- Henrik Voigt, Monique Meuschke, Sina Zarrieß, and Kai Lawonn. 2022. KeywordScape: Visual Document Exploration using Contextualized Keyword Embeddings. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pages 137–147, Abu Dhabi, UAE. Association for Computational Linguistics.
- Cite (Informal):
- KeywordScape: Visual Document Exploration using Contextualized Keyword Embeddings (Voigt et al., EMNLP 2022)
- PDF:
- https://preview.aclanthology.org/nodalida-main-page/2022.emnlp-demos.14.pdf