Universal Dependencies for Serbian in Comparison with Croatian and Other Slavic Languages

Tanja Samardžić, Mirjana Starović, Željko Agić, Nikola Ljubešić


Abstract
The paper documents the procedure of building a new Universal Dependencies (UDv2) treebank for Serbian starting from an existing Croatian UDv1 treebank and taking into account the other Slavic UD annotation guidelines. We describe the automatic and manual annotation procedures, discuss the annotation of Slavic-specific categories (case governing quantifiers, reflexive pronouns, question particles) and propose an approach to handling deverbal nouns in Slavic languages.
Anthology ID:
W17-1407
Volume:
Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing
Month:
April
Year:
2017
Address:
Valencia, Spain
Editors:
Tomaž Erjavec, Jakub Piskorski, Lidia Pivovarova, Jan Šnajder, Josef Steinberger, Roman Yangarber
Venue:
BSNLP
SIG:
SIGSLAV
Publisher:
Association for Computational Linguistics
Note:
Pages:
39–44
Language:
URL:
https://aclanthology.org/W17-1407
DOI:
10.18653/v1/W17-1407
Bibkey:
Cite (ACL):
Tanja Samardžić, Mirjana Starović, Željko Agić, and Nikola Ljubešić. 2017. Universal Dependencies for Serbian in Comparison with Croatian and Other Slavic Languages. In Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing, pages 39–44, Valencia, Spain. Association for Computational Linguistics.
Cite (Informal):
Universal Dependencies for Serbian in Comparison with Croatian and Other Slavic Languages (Samardžić et al., BSNLP 2017)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-2/W17-1407.pdf
Data
Universal Dependencies