WordNet—Wikipedia—Wiktionary: Construction of a Three-way Alignment

Tristan Miller, Iryna Gurevych


Abstract
The coverage and quality of conceptual information contained in lexical semantic resources is crucial for many tasks in natural language processing. Automatic alignment of complementary resources is one way of improving this coverage and quality; however, past attempts have always been between pairs of specific resources. In this paper we establish some set-theoretic conventions for describing concepts and their alignments, and use them to describe a method for automatically constructing n-way alignments from arbitrary pairwise alignments. We apply this technique to the production of a three-way alignment from previously published WordNet-Wikipedia and WordNet-Wiktionary alignments. We then present a quantitative and informal qualitative analysis of the aligned resource. The three-way alignment was found to have greater coverage, an enriched sense representation, and coarser sense granularity than both the original resources and their pairwise alignments, though this came at the cost of accuracy. An evaluation of the induced word sense clusters in a word sense disambiguation task showed that they were no better than random clusters of equivalent granularity. However, use of the alignments to enrich a sense inventory with additional sense glosses did significantly improve the performance of a baseline knowledge-based WSD algorithm.
Anthology ID:
L14-1345
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2094–2100
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/4_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Tristan Miller and Iryna Gurevych. 2014. WordNet—Wikipedia—Wiktionary: Construction of a Three-way Alignment. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 2094–2100, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
WordNet—Wikipedia—Wiktionary: Construction of a Three-way Alignment (Miller & Gurevych, LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/4_Paper.pdf