Javier Santillan
2025
Py-Elotl: A Python NLP package for the languages of Mexico
Ximena Gutierrez-Vasques
|
Robert Pugh
|
Victor Mijangos
|
Diego Barriga Martínez
|
Paul Aguilar
|
Mikel Segura
|
Paola Innes
|
Javier Santillan
|
Cynthia Montaño
|
Francis Tyers
Proceedings of the Fifth Workshop on NLP for Indigenous Languages of the Americas (AmericasNLP)
This work presents Py-elotl, a suite of tools and resources in Python for processing text in several indigenous languages spoken in Mexico. These resources include parallel corpora, linguistic taggers/analyzers, and orthographic normalization tools. This work aims to develop essential resources to support language pre-processing and linguistic research, and the future creation of more complete downstream applications that could be useful for the speakers and enhance the visibility of these languages. The current version supports language groups such as Nahuatl, Otomi, Mixtec, and Huave. This project is open-source and freely available for use and collaboration
Search
Fix data
Co-authors
- Paul Aguilar 1
- Diego Barriga Martínez 1
- Ximena Gutierrez-Vasques 1
- Paola Innes 1
- Victor Mijangos 1
- show all...