Parallel Universal Dependencies Treebanks for Turkic Languages
Arofat Akhundjanova, Furkan Akkurt, Bermet Chontaeva, Soudabeh Eslami, Cagri Coltekin
Abstract
We introduce the first fully aligned and manually annotated parallel Universal Dependencies (UD) treebanks for four Turkic languages: Azerbaijani, Kyrgyz, Turkish, and Uzbek. These resources currently consist of 148 strategically selected sentences that illustrate typologically significant morphosyntactic phenomena across these related yet distinct languages. These parallel treebanks enable systematic comparative studies of Turkic syntax and may be instrumental in cross-lingual NLP applications. All treebanks are available as part of UD v2.16.- Anthology ID:
- 2025.udw-1.14
- Volume:
- Proceedings of the Eighth Workshop on Universal Dependencies (UDW, SyntaxFest 2025)
- Month:
- August
- Year:
- 2025
- Address:
- Ljubljana, Slovenia
- Editors:
- Gosse Bomma, Çağrı Çöltekin
- Venues:
- UDW | WS | SyntaxFest
- SIG:
- SIGPARSE
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 129–136
- Language:
- URL:
- https://preview.aclanthology.org/mtsummit-25-ingestion/2025.udw-1.14/
- DOI:
- Cite (ACL):
- Arofat Akhundjanova, Furkan Akkurt, Bermet Chontaeva, Soudabeh Eslami, and Cagri Coltekin. 2025. Parallel Universal Dependencies Treebanks for Turkic Languages. In Proceedings of the Eighth Workshop on Universal Dependencies (UDW, SyntaxFest 2025), pages 129–136, Ljubljana, Slovenia. Association for Computational Linguistics.
- Cite (Informal):
- Parallel Universal Dependencies Treebanks for Turkic Languages (Akhundjanova et al., UDW-SyntaxFest 2025)
- PDF:
- https://preview.aclanthology.org/mtsummit-25-ingestion/2025.udw-1.14.pdf