Projecting Multiword Expression Resources on a Polish Treebank

Agata Savary, Jakub Waszczuk


Abstract
Multiword expressions (MWEs) are linguistic objects containing two or more words and showing idiosyncratic behavior at different levels. Treebanks with annotated MWEs enable studies of such properties, as well as training and evaluation of MWE-aware parsers. However, few treebanks contain full-fledged MWE annotations. We show how this gap can be bridged in Polish by projecting 3 MWE resources on a constituency treebank.
Anthology ID:
W17-1404
Volume:
Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing
Month:
April
Year:
2017
Address:
Valencia, Spain
Editors:
Tomaž Erjavec, Jakub Piskorski, Lidia Pivovarova, Jan Šnajder, Josef Steinberger, Roman Yangarber
Venue:
BSNLP
SIG:
SIGSLAV
Publisher:
Association for Computational Linguistics
Note:
Pages:
20–26
Language:
URL:
https://aclanthology.org/W17-1404
DOI:
10.18653/v1/W17-1404
Bibkey:
Cite (ACL):
Agata Savary and Jakub Waszczuk. 2017. Projecting Multiword Expression Resources on a Polish Treebank. In Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing, pages 20–26, Valencia, Spain. Association for Computational Linguistics.
Cite (Informal):
Projecting Multiword Expression Resources on a Polish Treebank (Savary & Waszczuk, BSNLP 2017)
Copy Citation:
PDF:
https://preview.aclanthology.org/emnlp-22-attachments/W17-1404.pdf