Abstract
Various lexical resources are being published in RDF. To enhance the usability of these resources, identical resources in different data sets should be linked. If lexical resources are described in different natural languages, then techniques to deal with multilinguality are required for interlinking. In this paper, we evaluate machine translation for interlinking concepts, i.e., generic entities named with a common noun or term. In our previous work, the evaluated method has been applied on named entities. We conduct two experiments involving different thesauri in different languages. The first experiment involves concepts from the TheSoz multilingual thesaurus in three languages: English, French and German. The second experiment involves concepts from the EuroVoc and AGROVOC thesauri in English and Chinese respectively. Our results demonstrate that machine translation can be beneficial for cross-lingual thesauri interlinking independently of a dataset structure.- Anthology ID:
- L16-1387
- Volume:
- Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
- Month:
- May
- Year:
- 2016
- Address:
- Portorož, Slovenia
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 2442–2449
- Language:
- URL:
- https://aclanthology.org/L16-1387
- DOI:
- Cite (ACL):
- Tatiana Lesnikova, Jérôme David, and Jérôme Euzenat. 2016. Cross-lingual RDF Thesauri Interlinking. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 2442–2449, Portorož, Slovenia. European Language Resources Association (ELRA).
- Cite (Informal):
- Cross-lingual RDF Thesauri Interlinking (Lesnikova et al., LREC 2016)
- PDF:
- https://preview.aclanthology.org/nodalida-main-page/L16-1387.pdf