SynSemClass Linked Lexicon: Mapping Synonymy between Languages

Zdenka Uresova, Eva Fucikova, Eva Hajicova, Jan Hajic


Abstract
This paper reports on an extended version of a synonym verb class lexicon, newly called SynSemClass (formerly CzEngClass). This lexicon stores cross-lingual semantically similar verb senses in synonym classes extracted from a richly annotated parallel corpus, the Prague Czech-English Dependency Treebank. When building the lexicon, we make use of predicate-argument relations (valency) and link them to semantic roles; in addition, each entry is linked to several external lexicons of more or less “semantic” nature, namely FrameNet, WordNet, VerbNet, OntoNotes and PropBank, and Czech VALLEX. The aim is to provide a linguistic resource that can be used to compare semantic roles and their syntactic properties and features across languages within and across synonym groups (classes, or ’synsets’), as well as gold standard data for automatic NLP experiments with such synonyms, such as synonym discovery, feature mapping, etc. However, perhaps the most important goal is to eventually build an event type ontology that can be referenced and used as a human-readable and human-understandable “database” for all types of events, processes and states. While the current paper describes primarily the content of the lexicon, we are also presenting a preliminary design of a format compatible with Linked Data, on which we are hoping to get feedback during discussions at the workshop. Once the resource (in whichever form) is applied to corpus annotation, deep analysis will be possible using such combined resources as training data.
Anthology ID:
2020.globalex-1.2
Volume:
Proceedings of the 2020 Globalex Workshop on Linked Lexicography
Month:
May
Year:
2020
Address:
Marseille, France
Editors:
Ilan Kernerman, Simon Krek, John P. McCrae, Jorge Gracia, Sina Ahmadi, Besim Kabashi
Venue:
GLOBALEX
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
10–19
Language:
English
URL:
https://aclanthology.org/2020.globalex-1.2
DOI:
Bibkey:
Cite (ACL):
Zdenka Uresova, Eva Fucikova, Eva Hajicova, and Jan Hajic. 2020. SynSemClass Linked Lexicon: Mapping Synonymy between Languages. In Proceedings of the 2020 Globalex Workshop on Linked Lexicography, pages 10–19, Marseille, France. European Language Resources Association.
Cite (Informal):
SynSemClass Linked Lexicon: Mapping Synonymy between Languages (Uresova et al., GLOBALEX 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/naacl24-info/2020.globalex-1.2.pdf