Semi-Supervised Neural System for Tagging, Parsing and Lematization

Piotr Rybak, Alina Wróblewska


Abstract
This paper describes the ICS PAS system which took part in CoNLL 2018 shared task on Multilingual Parsing from Raw Text to Universal Dependencies. The system consists of jointly trained tagger, lemmatizer, and dependency parser which are based on features extracted by a biLSTM network. The system uses both fully connected and dilated convolutional neural architectures. The novelty of our approach is the use of an additional loss function, which reduces the number of cycles in the predicted dependency graphs, and the use of self-training to increase the system performance. The proposed system, i.e. ICS PAS (Warszawa), ranked 3th/4th in the official evaluation obtaining the following overall results: 73.02 (LAS), 60.25 (MLAS) and 64.44 (BLEX).
Anthology ID:
K18-2004
Volume:
Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies
Month:
October
Year:
2018
Address:
Brussels, Belgium
Editors:
Daniel Zeman, Jan Hajič
Venue:
CoNLL
SIG:
SIGNLL
Publisher:
Association for Computational Linguistics
Note:
Pages:
45–54
Language:
URL:
https://aclanthology.org/K18-2004
DOI:
10.18653/v1/K18-2004
Bibkey:
Cite (ACL):
Piotr Rybak and Alina Wróblewska. 2018. Semi-Supervised Neural System for Tagging, Parsing and Lematization. In Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, pages 45–54, Brussels, Belgium. Association for Computational Linguistics.
Cite (Informal):
Semi-Supervised Neural System for Tagging, Parsing and Lematization (Rybak & Wróblewska, CoNLL 2018)
Copy Citation:
PDF:
https://preview.aclanthology.org/fix-dup-bibkey/K18-2004.pdf
Code
 360er0/COMBO
Data
Universal Dependencies