Cross-lingual RDF Thesauri Interlinking

Tatiana Lesnikova, Jérôme David, Jérôme Euzenat


Abstract
Various lexical resources are being published in RDF. To enhance the usability of these resources, identical resources in different data sets should be linked. If lexical resources are described in different natural languages, then techniques to deal with multilinguality are required for interlinking. In this paper, we evaluate machine translation for interlinking concepts, i.e., generic entities named with a common noun or term. In our previous work, the evaluated method has been applied on named entities. We conduct two experiments involving different thesauri in different languages. The first experiment involves concepts from the TheSoz multilingual thesaurus in three languages: English, French and German. The second experiment involves concepts from the EuroVoc and AGROVOC thesauri in English and Chinese respectively. Our results demonstrate that machine translation can be beneficial for cross-lingual thesauri interlinking independently of a dataset structure.
Anthology ID:
L16-1387
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2442–2449
Language:
URL:
https://aclanthology.org/L16-1387
DOI:
Bibkey:
Cite (ACL):
Tatiana Lesnikova, Jérôme David, and Jérôme Euzenat. 2016. Cross-lingual RDF Thesauri Interlinking. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 2442–2449, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Cross-lingual RDF Thesauri Interlinking (Lesnikova et al., LREC 2016)
Copy Citation:
PDF:
https://preview.aclanthology.org/nodalida-main-page/L16-1387.pdf