Towards a richer wordnet representation of properties

Sanni Nimb, Bolette Sandford Pedersen


Abstract
This paper discusses how information on properties in a currently developed Danish thesaurus can be transferred to the Danish wordnet, DanNet, and in this way enrich the wordnet with the highly relevant links between properties and their external arguments (i.e. tasty ― food). In spite of the fact that the thesaurus is still under development (two thirds still to be compiled) we perform an automatic transfer of relations from the thesaurus to the wordnet which shows promising results. In all, 2,362 property relations are automatically transferred to DanNet and 2% of the transferred material is manually validated. The pilot validation indicates that approx. 90 % of the transferred relations are correctly assigned whereas around 10% are either erroneous or just not very informative, a fact which, however, can partly be explained by the incompleteness of the material at its current stage. As a further consequence, the experiment has led to a richer specification of the editor guidelines to be used in the last compilation phase of the thesaurus.
Anthology ID:
L12-1097
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3452–3456
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/247_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Sanni Nimb and Bolette Sandford Pedersen. 2012. Towards a richer wordnet representation of properties. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 3452–3456, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Towards a richer wordnet representation of properties (Nimb & Pedersen, LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/247_Paper.pdf