Abstract
This paper describes the development and evaluation of a FST-based analyser-generator for Mapudüngun language, which is publicly available through a web interface. As far as we know, it is the first system of this kind for Mapudüngun. Following the Mapuche grammar by Smeets, we have developed a machine including the morphological and phonological aspects of Mapudüngun. Through this computational approach we have produced a finite state morphological analyser-generator capable of classifying and appropriately tagging all the components (roots and suffixes) interacting in a Mapuche word-form. A double evaluation has been carried out showing a good level of reliability. In order to face the lack of standardization of the language, additional components (an enhanced analyser, a spelling unifier and a root guesser) have been integrated in the tool. The generated corpora, the lexicons and the FST grammars are available for further development and comparison results.- Anthology ID:
- 2022.lrec-1.702
- Volume:
- Proceedings of the Thirteenth Language Resources and Evaluation Conference
- Month:
- June
- Year:
- 2022
- Address:
- Marseille, France
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association
- Note:
- Pages:
- 6540–6547
- Language:
- URL:
- https://aclanthology.org/2022.lrec-1.702
- DOI:
- Cite (ACL):
- Andrés Chandía. 2022. A Mapudüngun FST Morphological Analyser and its Web Interface. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 6540–6547, Marseille, France. European Language Resources Association.
- Cite (Informal):
- A Mapudüngun FST Morphological Analyser and its Web Interface (Chandía, LREC 2022)
- PDF:
- https://preview.aclanthology.org/remove-xml-comments/2022.lrec-1.702.pdf