Interpretable Text Embeddings and Text Similarity Explanation: A Survey

Juri Opitz; Lucas Moeller; Andrianos Michail; Sebastian Padó; Simon Clematide

Interpretable Text Embeddings and Text Similarity Explanation: A Survey

Juri Opitz, Lucas Moeller, Andrianos Michail, Sebastian Padó, Simon Clematide

Abstract

Text embeddings are a fundamental component in many NLP tasks, including classification, regression, clustering, and semantic search. However, despite their ubiquitous application, challenges persist in interpreting embeddings and explaining similarities between them.In this work, we provide a structured overview of methods specializing in inherently interpretable text embeddings and text similarity explanation, an underexplored research area. We characterize the main ideas, approaches, and trade-offs. We compare means of evaluation, discuss overarching lessons learned and finally identify opportunities and open challenges for future research.

Anthology ID:: 2025.emnlp-main.1135
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 22314–22330
Language:
URL:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1135/
DOI:
Bibkey:
Cite (ACL):: Juri Opitz, Lucas Moeller, Andrianos Michail, Sebastian Padó, and Simon Clematide. 2025. Interpretable Text Embeddings and Text Similarity Explanation: A Survey. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 22314–22330, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Interpretable Text Embeddings and Text Similarity Explanation: A Survey (Opitz et al., EMNLP 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1135.pdf
Checklist:: 2025.emnlp-main.1135.checklist.pdf

PDF Cite Search Checklist Fix data