Abstract
In this research, we evaluate different approaches for the automatic extraction of hypernym relations from English and Dutch technical text. The detected hypernym relations should enable us to semantically structure automatically obtained term lists from domain- and user-specific data. We investigated three different hypernymy extraction approaches for Dutch and English: a lexico-syntactic pattern-based approach, a distributional model and a morpho-syntactic method. To test the performance of the different approaches on domain-specific data, we collected and manually annotated English and Dutch data from two technical domains, viz. the dredging and financial domain. The experimental results show that especially the morpho-syntactic approach obtains good results for automatic hypernym extraction from technical and domain-specific texts.- Anthology ID:
- L14-1365
- Volume:
- Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
- Month:
- May
- Year:
- 2014
- Address:
- Reykjavik, Iceland
- Editors:
- Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 490–497
- Language:
- URL:
- http://www.lrec-conf.org/proceedings/lrec2014/pdf/426_Paper.pdf
- DOI:
- Cite (ACL):
- Els Lefever, Marjan Van de Kauter, and Véronique Hoste. 2014. Evaluation of Automatic Hypernym Extraction from Technical Corpora in English and Dutch. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 490–497, Reykjavik, Iceland. European Language Resources Association (ELRA).
- Cite (Informal):
- Evaluation of Automatic Hypernym Extraction from Technical Corpora in English and Dutch (Lefever et al., LREC 2014)
- PDF:
- http://www.lrec-conf.org/proceedings/lrec2014/pdf/426_Paper.pdf