UD Treebanks for Esperanto as a natural language

Masanori Oya


Abstract
This paper describes the details of UD-based morphological and syntactic annotations on Esperanto texts to construct its small-scale UD treebank. Though it was created as an international auxiliary language, Esperanto has increasingly been studied as a natural language both in linguistics and in NLP. This paper introduces the detail of manual annotation of UD morphological and relational tags and describes how the frequencies of these tags differ across the treebanks and discusses the possibility of future research of Esperanto as a natural language.
Anthology ID:
2025.udw-1.3
Volume:
Proceedings of the Eighth Workshop on Universal Dependencies (UDW, SyntaxFest 2025)
Month:
August
Year:
2025
Address:
Ljubljana, Slovenia
Editors:
Gosse Bomma, Çağrı Çöltekin
Venues:
UDW | WS | SyntaxFest
SIG:
SIGPARSE
Publisher:
Association for Computational Linguistics
Note:
Pages:
22–29
Language:
URL:
https://preview.aclanthology.org/mtsummit-25-ingestion/2025.udw-1.3/
DOI:
Bibkey:
Cite (ACL):
Masanori Oya. 2025. UD Treebanks for Esperanto as a natural language. In Proceedings of the Eighth Workshop on Universal Dependencies (UDW, SyntaxFest 2025), pages 22–29, Ljubljana, Slovenia. Association for Computational Linguistics.
Cite (Informal):
UD Treebanks for Esperanto as a natural language (Oya, UDW-SyntaxFest 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/mtsummit-25-ingestion/2025.udw-1.3.pdf