Abstract
This is a system description paper for the CUNI x-ling submission to the CoNLL 2018 UD Shared Task. We focused on parsing under-resourced languages, with no or little training data available. We employed a wide range of approaches, including simple word-based treebank translation, combination of delexicalized parsers, and exploitation of available morphological dictionaries, with a dedicated setup tailored to each of the languages. In the official evaluation, our submission was identified as the clear winner of the Low-resource languages category.- Anthology ID:
- K18-2019
- Volume:
- Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies
- Month:
- October
- Year:
- 2018
- Address:
- Brussels, Belgium
- Venue:
- CoNLL
- SIG:
- SIGNLL
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 187–196
- Language:
- URL:
- https://aclanthology.org/K18-2019
- DOI:
- 10.18653/v1/K18-2019
- Cite (ACL):
- Rudolf Rosa and David Mareček. 2018. CUNI x-ling: Parsing Under-Resourced Languages in CoNLL 2018 UD Shared Task. In Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, pages 187–196, Brussels, Belgium. Association for Computational Linguistics.
- Cite (Informal):
- CUNI x-ling: Parsing Under-Resourced Languages in CoNLL 2018 UD Shared Task (Rosa & Mareček, CoNLL 2018)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/K18-2019.pdf
- Data
- OpenSubtitles, Universal Dependencies