SemEval-2017 Task 2: Multilingual and Cross-lingual Semantic Word Similarity

Jose Camacho-Collados, Mohammad Taher Pilehvar, Nigel Collier, Roberto Navigli


Abstract
This paper introduces a new task on Multilingual and Cross-lingual SemanticThis paper introduces a new task on Multilingual and Cross-lingual Semantic Word Similarity which measures the semantic similarity of word pairs within and across five languages: English, Farsi, German, Italian and Spanish. High quality datasets were manually curated for the five languages with high inter-annotator agreements (consistently in the 0.9 ballpark). These were used for semi-automatic construction of ten cross-lingual datasets. 17 teams participated in the task, submitting 24 systems in subtask 1 and 14 systems in subtask 2. Results show that systems that combine statistical knowledge from text corpora, in the form of word embeddings, and external knowledge from lexical resources are best performers in both subtasks. More information can be found on the task website: http://alt.qcri.org/semeval2017/task2/
Anthology ID:
S17-2002
Volume:
Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017)
Month:
August
Year:
2017
Address:
Vancouver, Canada
Venue:
SemEval
SIGs:
SIGLEX | SIGSEM
Publisher:
Association for Computational Linguistics
Note:
Pages:
15–26
Language:
URL:
https://aclanthology.org/S17-2002
DOI:
10.18653/v1/S17-2002
Bibkey:
Cite (ACL):
Jose Camacho-Collados, Mohammad Taher Pilehvar, Nigel Collier, and Roberto Navigli. 2017. SemEval-2017 Task 2: Multilingual and Cross-lingual Semantic Word Similarity. In Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), pages 15–26, Vancouver, Canada. Association for Computational Linguistics.
Cite (Informal):
SemEval-2017 Task 2: Multilingual and Cross-lingual Semantic Word Similarity (Camacho-Collados et al., SemEval 2017)
Copy Citation:
PDF:
https://preview.aclanthology.org/auto-file-uploads/S17-2002.pdf
Data
ConceptNet