Mariana Illescas
2022
Building an Endangered Language Resource in the Classroom: Universal Dependencies for Kakataibo
Roberto Zariquiey
|
Claudia Alvarado
|
Ximena Echevarría
|
Luisa Gomez
|
Rosa Gonzales
|
Mariana Illescas
|
Sabina Oporto
|
Frederic Blum
|
Arturo Oncevay
|
Javier Vera
Proceedings of the Thirteenth Language Resources and Evaluation Conference
In this paper, we launch a new Universal Dependencies treebank for an endangered language from Amazonia: Kakataibo, a Panoan language spoken in Peru. We first discuss the collaborative methodology implemented, which proved effective to create a treebank in the context of a Computational Linguistic course for undergraduates. Then, we describe the general details of the treebank and the language-specific considerations implemented for the proposed annotation. We finally conduct some experiments on part-of-speech tagging and syntactic dependency parsing. We focus on monolingual and transfer learning settings, where we study the impact of a Shipibo-Konibo treebank, another Panoan language resource.
Search
Co-authors
- Roberto Zariquiey 1
- Claudia Alvarado 1
- Ximena Echevarría 1
- Luisa Gomez 1
- Rosa Gonzales 1
- show all...
Venues
- lrec1