Abstract
This paper presents an approach of automatic annotation of sentences with dependency structures. The approach builds on the idea of cross-lingual dependency projection. The presented method of acquiring dependency trees involves a weighting factor in the processes of projecting source dependency relations to target sentences and inducing well-formed target dependency trees from sets of projected dependency relations. Using a parallel corpus, source trees are transferred onto equivalent target sentences via an extended set of alignment links. Projected arcs are initially weighted according to the certainty of word alignment links. Then, arc weights are recalculated using a method based on the EM selection algorithm. Maximum spanning trees selected from EM-scored digraphs and labelled with appropriate grammatical functions constitute a target dependency treebank. Extrinsic evaluation shows that parsers trained on such a treebank may perform comparably to parsers trained on a manually developed treebank.- Anthology ID:
- L14-1446
- Volume:
- Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
- Month:
- May
- Year:
- 2014
- Address:
- Reykjavik, Iceland
- Editors:
- Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 2306–2312
- Language:
- URL:
- http://www.lrec-conf.org/proceedings/lrec2014/pdf/538_Paper.pdf
- DOI:
- Cite (ACL):
- Alina Wróblewska and Adam Przepiórkowski. 2014. Projection-based Annotation of a Polish Dependency Treebank. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 2306–2312, Reykjavik, Iceland. European Language Resources Association (ELRA).
- Cite (Informal):
- Projection-based Annotation of a Polish Dependency Treebank (Wróblewska & Przepiórkowski, LREC 2014)
- PDF:
- http://www.lrec-conf.org/proceedings/lrec2014/pdf/538_Paper.pdf