Abstract
In this paper, we propose a simple methodology for building or extending wordnets using easily extractible lexical knowledge from Wiktionary and Wikipedia. This method relies on a large multilingual translation/synonym graph in many languages as well as synset-aligned wordnets. It guesses frequent and polysemous literals that are difficult to find using other methods by looking at back-translations in the graph, showing that the use of a heavily multilingual lexicon can be a way to mitigate the lack of wide coverage bilingual lexicon for wordnet creation or extension. We evaluate our approach on French by applying it for extending WOLF, a freely available French wordnet.- Anthology ID:
- L12-1669
- Volume:
- Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
- Month:
- May
- Year:
- 2012
- Address:
- Istanbul, Turkey
- Editors:
- Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 3473–3478
- Language:
- URL:
- http://www.lrec-conf.org/proceedings/lrec2012/pdf/1131_Paper.pdf
- DOI:
- Cite (ACL):
- Valérie Hanoka and Benoît Sagot. 2012. Wordnet extension made simple: A multilingual lexicon-based approach using wiki resources. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 3473–3478, Istanbul, Turkey. European Language Resources Association (ELRA).
- Cite (Informal):
- Wordnet extension made simple: A multilingual lexicon-based approach using wiki resources (Hanoka & Sagot, LREC 2012)
- PDF:
- http://www.lrec-conf.org/proceedings/lrec2012/pdf/1131_Paper.pdf