Compiling large language resources using lexical similarity metrics for domain taxonomy learning

Ronny Melz, Pum-Mo Ryu, Key-Sun Choi


Abstract
In this contribution we present a new methodology to compile large language resources for domain-specific taxonomy learning. We describe the necessary stages to deal with the rich morphology of an agglutinative language, i.e. Korean, and point out a second order machine learning algorithm to unveil term similarity from a given raw text corpus. The language resource compilation described is part of a fully automatic top-down approach to construct taxonomies, without involving the human efforts which are usually required.
Anthology ID:
L06-1266
Volume:
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)
Month:
May
Year:
2006
Address:
Genoa, Italy
Editors:
Nicoletta Calzolari, Khalid Choukri, Aldo Gangemi, Bente Maegaard, Joseph Mariani, Jan Odijk, Daniel Tapias
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2006/pdf/446_pdf.pdf
DOI:
Bibkey:
Cite (ACL):
Ronny Melz, Pum-Mo Ryu, and Key-Sun Choi. 2006. Compiling large language resources using lexical similarity metrics for domain taxonomy learning. In Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06), Genoa, Italy. European Language Resources Association (ELRA).
Cite (Informal):
Compiling large language resources using lexical similarity metrics for domain taxonomy learning (Melz et al., LREC 2006)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2006/pdf/446_pdf.pdf