Towards a morphological transducer and orthography converter for Western Tlacolula Valley Zapotec

Jonathan Washington, Felipe Lopez, Brook Lillehaugen


Abstract
This paper presents work towards a morphological transducer and orthography converter for Dizhsa, or San Lucas Quiaviní Zapotec, an endangered Western Tlacolula Valley Zapotec language. The implementation of various aspects of the language’s morphology is presented, as well as the transducer’s ability to perform analysis in two orthographies and convert between them. Potential uses of the transducer for language maintenance and issues of licensing are also discussed. Evaluation of the transducer shows that it is fairly robust although incomplete, and evaluation of orthographic conversion shows that this method is strongly affected by the coverage of the transducer.
Anthology ID:
2021.americasnlp-1.21
Volume:
Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas
Month:
June
Year:
2021
Address:
Online
Editors:
Manuel Mager, Arturo Oncevay, Annette Rios, Ivan Vladimir Meza Ruiz, Alexis Palmer, Graham Neubig, Katharina Kann
Venue:
AmericasNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
185–193
Language:
URL:
https://aclanthology.org/2021.americasnlp-1.21
DOI:
10.18653/v1/2021.americasnlp-1.21
Bibkey:
Cite (ACL):
Jonathan Washington, Felipe Lopez, and Brook Lillehaugen. 2021. Towards a morphological transducer and orthography converter for Western Tlacolula Valley Zapotec. In Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas, pages 185–193, Online. Association for Computational Linguistics.
Cite (Informal):
Towards a morphological transducer and orthography converter for Western Tlacolula Valley Zapotec (Washington et al., AmericasNLP 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/dois-2013-emnlp/2021.americasnlp-1.21.pdf
Code
 apertium/apertium-zab