IRB-NLP at SemEval-2022 Task 1: Exploring the Relationship Between Words and Their Semantic Representations

Damir Korenčić, Ivan Grubisic


Abstract
What is the relation between a word and its description, or a word and its embedding? Both descriptions and embeddings are semantic representations of words. But, what information from the original word remains in these representations? Or more importantly, which information about a word do these two representations share? Definition Modeling and Reverse Dictionary are two opposite learning tasks that address these questions. The goal of the Definition Modeling task is to investigate the power of information laying inside a word embedding to express the meaning of the word in a humanly understandable way – as a dictionary definition. Conversely, the Reverse Dictionary task explores the ability to predict word embeddings directly from its definition. In this paper, by tackling these two tasks, we are exploring the relationship between words and their semantic representations. We present our findings based on the descriptive, exploratory, and predictive data analysis conducted on the CODWOE dataset. We give a detailed overview of the systems that we designed for Definition Modeling and Reverse Dictionary tasks, and that achieved top scores on SemEval-2022 CODWOE challenge in several subtasks. We hope that our experimental results concerning the predictive models and the data analyses we provide will prove useful in future explorations of word representations and their relationships.
Anthology ID:
2022.semeval-1.5
Volume:
Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022)
Month:
July
Year:
2022
Address:
Seattle, United States
Venue:
SemEval
SIGs:
SIGLEX | SIGSEM
Publisher:
Association for Computational Linguistics
Note:
Pages:
36–59
Language:
URL:
https://aclanthology.org/2022.semeval-1.5
DOI:
10.18653/v1/2022.semeval-1.5
Bibkey:
Cite (ACL):
Damir Korenčić and Ivan Grubisic. 2022. IRB-NLP at SemEval-2022 Task 1: Exploring the Relationship Between Words and Their Semantic Representations. In Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022), pages 36–59, Seattle, United States. Association for Computational Linguistics.
Cite (Informal):
IRB-NLP at SemEval-2022 Task 1: Exploring the Relationship Between Words and Their Semantic Representations (Korenčić & Grubisic, SemEval 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/auto-file-uploads/2022.semeval-1.5.pdf
Video:
 https://preview.aclanthology.org/auto-file-uploads/2022.semeval-1.5.mp4