CUNI x-ling: Parsing Under-Resourced Languages in CoNLL 2018 UD Shared Task

Rudolf Rosa, David Mareček


Abstract
This is a system description paper for the CUNI x-ling submission to the CoNLL 2018 UD Shared Task. We focused on parsing under-resourced languages, with no or little training data available. We employed a wide range of approaches, including simple word-based treebank translation, combination of delexicalized parsers, and exploitation of available morphological dictionaries, with a dedicated setup tailored to each of the languages. In the official evaluation, our submission was identified as the clear winner of the Low-resource languages category.
Anthology ID:
K18-2019
Volume:
Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies
Month:
October
Year:
2018
Address:
Brussels, Belgium
Editors:
Daniel Zeman, Jan Hajič
Venue:
CoNLL
SIG:
SIGNLL
Publisher:
Association for Computational Linguistics
Note:
Pages:
187–196
Language:
URL:
https://aclanthology.org/K18-2019
DOI:
10.18653/v1/K18-2019
Bibkey:
Cite (ACL):
Rudolf Rosa and David Mareček. 2018. CUNI x-ling: Parsing Under-Resourced Languages in CoNLL 2018 UD Shared Task. In Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, pages 187–196, Brussels, Belgium. Association for Computational Linguistics.
Cite (Informal):
CUNI x-ling: Parsing Under-Resourced Languages in CoNLL 2018 UD Shared Task (Rosa & Mareček, CoNLL 2018)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-2/K18-2019.pdf
Data
OpenSubtitlesUniversal Dependencies