Crossing Dialectal Boundaries: Building a Treebank for the Dialect of Lesbos through Knowledge Transfer from Standard Modern Greek
Stavros Bompolas, Stella Markantonatou, Angela Ralli, Antonios Anastasopoulos
Abstract
This paper presents the first treebank for the dialect of Lesbos, a low-resource living Northern variety of Modern Greek (MG), annotated according to the Universal Dependencies (UD) framework. So far, the only dialectal treebank available for Greek developed with cross-dialectal knowledge transfer is an East Cretan one, which belongs to the same Southern branch as Standard Modern Greek (SMG). Our study investigates the effectiveness of cross-dialectal knowledge transfer between dialectologically less similar varieties of the same language by leveraging knowledge from SMG to annotate the Northern dialect of Lesbos. We describe the annotation process, present the resulting treebank, inject additional linguistic knowledge to enhance the results, and evaluate the effectiveness of cross-dialectal knowledge transfer for active annotation. Our findings contribute to a better understanding of how dialectal variation within language families affects knowledge transfer in the UD framework, with implications for other low-resource varieties.- Anthology ID:
- 2025.udw-1.5
- Volume:
- Proceedings of the Eighth Workshop on Universal Dependencies (UDW, SyntaxFest 2025)
- Month:
- August
- Year:
- 2025
- Address:
- Ljubljana, Slovenia
- Editors:
- Gosse Bomma, Çağrı Çöltekin
- Venues:
- UDW | WS | SyntaxFest
- SIG:
- SIGPARSE
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 39–51
- Language:
- URL:
- https://preview.aclanthology.org/mtsummit-25-ingestion/2025.udw-1.5/
- DOI:
- Cite (ACL):
- Stavros Bompolas, Stella Markantonatou, Angela Ralli, and Antonios Anastasopoulos. 2025. Crossing Dialectal Boundaries: Building a Treebank for the Dialect of Lesbos through Knowledge Transfer from Standard Modern Greek. In Proceedings of the Eighth Workshop on Universal Dependencies (UDW, SyntaxFest 2025), pages 39–51, Ljubljana, Slovenia. Association for Computational Linguistics.
- Cite (Informal):
- Crossing Dialectal Boundaries: Building a Treebank for the Dialect of Lesbos through Knowledge Transfer from Standard Modern Greek (Bompolas et al., UDW-SyntaxFest 2025)
- PDF:
- https://preview.aclanthology.org/mtsummit-25-ingestion/2025.udw-1.5.pdf