Abstract
We describe the SEx BiST parser (Semantically EXtended Bi-LSTM parser) developed at Lattice for the CoNLL 2018 Shared Task (Multilingual Parsing from Raw Text to Universal Dependencies). The main characteristic of our work is the encoding of three different modes of contextual information for parsing: (i) Treebank feature representations, (ii) Multilingual word representations, (iii) ELMo representations obtained via unsupervised learning from external resources. Our parser performed well in the official end-to-end evaluation (73.02 LAS – 4th/26 teams, and 78.72 UAS – 2nd/26); remarkably, we achieved the best UAS scores on all the English corpora by applying the three suggested feature representations. Finally, we were also ranked 1st at the optional event extraction task, part of the 2018 Extrinsic Parser Evaluation campaign.- Anthology ID:
- K18-2014
- Volume:
- Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies
- Month:
- October
- Year:
- 2018
- Address:
- Brussels, Belgium
- Editors:
- Daniel Zeman, Jan Hajič
- Venue:
- CoNLL
- SIG:
- SIGNLL
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 143–152
- Language:
- URL:
- https://aclanthology.org/K18-2014
- DOI:
- 10.18653/v1/K18-2014
- Cite (ACL):
- KyungTae Lim, Cheoneum Park, Changki Lee, and Thierry Poibeau. 2018. SEx BiST: A Multi-Source Trainable Parser with Deep Contextualized Lexical Representations. In Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, pages 143–152, Brussels, Belgium. Association for Computational Linguistics.
- Cite (Informal):
- SEx BiST: A Multi-Source Trainable Parser with Deep Contextualized Lexical Representations (Lim et al., CoNLL 2018)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-2/K18-2014.pdf
- Code
- jujbob/multilingual-models