Universal Dependencies and Morphology for Hungarian - and on the Price of Universality
Veronika Vincze, Katalin Simkó, Zsolt Szántó, Richárd Farkas
Abstract
In this paper, we present how the principles of universal dependencies and morphology have been adapted to Hungarian. We report the most challenging grammatical phenomena and our solutions to those. On the basis of the adapted guidelines, we have converted and manually corrected 1,800 sentences from the Szeged Treebank to universal dependency format. We also introduce experiments on this manually annotated corpus for evaluating automatic conversion and the added value of language-specific, i.e. non-universal, annotations. Our results reveal that converting to universal dependencies is not necessarily trivial, moreover, using language-specific morphological features may have an impact on overall performance.- Anthology ID:
- E17-1034
- Volume:
- Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers
- Month:
- April
- Year:
- 2017
- Address:
- Valencia, Spain
- Venue:
- EACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 356–365
- Language:
- URL:
- https://aclanthology.org/E17-1034
- DOI:
- Cite (ACL):
- Veronika Vincze, Katalin Simkó, Zsolt Szántó, and Richárd Farkas. 2017. Universal Dependencies and Morphology for Hungarian - and on the Price of Universality. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers, pages 356–365, Valencia, Spain. Association for Computational Linguistics.
- Cite (Informal):
- Universal Dependencies and Morphology for Hungarian - and on the Price of Universality (Vincze et al., EACL 2017)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/E17-1034.pdf
- Data
- Universal Dependencies