Converting an HPSG-based Treebank into its Parallel Dependency-based Treebank

Masood Ghayoomi, Jonas Kuhn


Abstract
A treebank is an important language resource for supervised statistical parsers. The parser induces the grammatical properties of a language from this language resource and uses the model to parse unseen data automatically. Since developing such a resource is very time-consuming and tedious, one can take advantage of already extant resources by adapting them to a particular application. This reduces the amount of human effort required to develop a new language resource. In this paper, we introduce an algorithm to convert an HPSG-based treebank into its parallel dependency-based treebank. With this converter, we can automatically create a new language resource from an existing treebank developed based on a grammar formalism. Our proposed algorithm is able to create both projective and non-projective dependency trees.
Anthology ID:
L14-1378
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
802–809
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/441_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Masood Ghayoomi and Jonas Kuhn. 2014. Converting an HPSG-based Treebank into its Parallel Dependency-based Treebank. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 802–809, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
Converting an HPSG-based Treebank into its Parallel Dependency-based Treebank (Ghayoomi & Kuhn, LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/441_Paper.pdf