Universal Dependencies for Serbian in Comparison with Croatian and Other Slavic Languages
Tanja Samardžić, Mirjana Starović, Željko Agić, Nikola Ljubešić
Abstract
The paper documents the procedure of building a new Universal Dependencies (UDv2) treebank for Serbian starting from an existing Croatian UDv1 treebank and taking into account the other Slavic UD annotation guidelines. We describe the automatic and manual annotation procedures, discuss the annotation of Slavic-specific categories (case governing quantifiers, reflexive pronouns, question particles) and propose an approach to handling deverbal nouns in Slavic languages.- Anthology ID:
- W17-1407
- Volume:
- Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing
- Month:
- April
- Year:
- 2017
- Address:
- Valencia, Spain
- Editors:
- Tomaž Erjavec, Jakub Piskorski, Lidia Pivovarova, Jan Šnajder, Josef Steinberger, Roman Yangarber
- Venue:
- BSNLP
- SIG:
- SIGSLAV
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 39–44
- Language:
- URL:
- https://aclanthology.org/W17-1407
- DOI:
- 10.18653/v1/W17-1407
- Cite (ACL):
- Tanja Samardžić, Mirjana Starović, Željko Agić, and Nikola Ljubešić. 2017. Universal Dependencies for Serbian in Comparison with Croatian and Other Slavic Languages. In Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing, pages 39–44, Valencia, Spain. Association for Computational Linguistics.
- Cite (Informal):
- Universal Dependencies for Serbian in Comparison with Croatian and Other Slavic Languages (Samardžić et al., BSNLP 2017)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-2/W17-1407.pdf
- Data
- Universal Dependencies