Dbnary: Wiktionary as a LMF based Multilingual RDF network

Gilles Sérasset


Abstract
Contributive resources, such as wikipedia, have proved to be valuable in Natural Language Processing or Multilingual Information Retrieval applications.This article focusses on Wiktionary, the dictionary part of the collaborative resources sponsored by the Wikimedia foundation. In this article we present a word net that has been extracted from French, English and German wiktionaries. We present the structure of this word net and discuss the specific extraction problems induced by this kind of contributive resources and the method used to overcome them. Then we show how we represent the extracted data as a Lexical Markup Framework (LMF) compatible lexical network represented in Resource Description Framework (RDF) format.
Anthology ID:
L12-1195
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2466–2472
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/387_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Gilles Sérasset. 2012. Dbnary: Wiktionary as a LMF based Multilingual RDF network. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 2466–2472, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Dbnary: Wiktionary as a LMF based Multilingual RDF network (Sérasset, LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/387_Paper.pdf