Open Text Collections as a Resource for Doing NLP with Eurasian Languages

Sebastian Nordhoff, Christian Döhler, Mandana Seyfeddinipur


Abstract
The Open Text Collections project establishes a high-quality publication channel for interlinear glossed text from endangered languages. Text collection will by made available in an open interoperable format and as a more traditional book publication. The project addresses a variety of audiences, eg. community members, typological linguists, anthropologists, NLP practitioners.
Anthology ID:
2024.eurali-1.3
Volume:
Proceedings of the 2nd Workshop on Resources and Technologies for Indigenous, Endangered and Lesser-resourced Languages in Eurasia (EURALI) @ LREC-COLING 2024
Month:
May
Year:
2024
Address:
Torino, Italia
Editors:
Atul Kr. Ojha, Sina Ahmadi, Silvie Cinková, Theodorus Fransen, Chao-Hong Liu, John P. McCrae
Venues:
EURALI | WS
SIG:
Publisher:
ELRA and ICCL
Note:
Pages:
18–23
Language:
URL:
https://aclanthology.org/2024.eurali-1.3
DOI:
Bibkey:
Cite (ACL):
Sebastian Nordhoff, Christian Döhler, and Mandana Seyfeddinipur. 2024. Open Text Collections as a Resource for Doing NLP with Eurasian Languages. In Proceedings of the 2nd Workshop on Resources and Technologies for Indigenous, Endangered and Lesser-resourced Languages in Eurasia (EURALI) @ LREC-COLING 2024, pages 18–23, Torino, Italia. ELRA and ICCL.
Cite (Informal):
Open Text Collections as a Resource for Doing NLP with Eurasian Languages (Nordhoff et al., EURALI-WS 2024)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-4/2024.eurali-1.3.pdf