Abstract
This paper reports on a number of experiments in which we applied standard techniques from NLP in the context of documentation of endangered languages. We concentrated on the use of existing, freely available toolkits. Specifically, we explore the use of Finite-State Morphological Analysis, Maximum Entropy Part-of-Speech Tagging, and N-Gram Language Modeling.- Anthology ID:
- L04-1240
- Volume:
- Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)
- Month:
- May
- Year:
- 2004
- Address:
- Lisbon, Portugal
- Editors:
- Maria Teresa Lino, Maria Francisca Xavier, Fátima Ferreira, Rute Costa, Raquel Silva
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- Language:
- URL:
- http://www.lrec-conf.org/proceedings/lrec2004/pdf/412.pdf
- DOI:
- Cite (ACL):
- Jonas Kuhn and B’alam Mateo-Toledo. 2004. Applying Computational Linguistic Techniques in a Documentary Project for Q’anjob’al (Mayan, Guatemala). In Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04), Lisbon, Portugal. European Language Resources Association (ELRA).
- Cite (Informal):
- Applying Computational Linguistic Techniques in a Documentary Project for Q’anjob’al (Mayan, Guatemala) (Kuhn & Mateo-Toledo, LREC 2004)
- PDF:
- http://www.lrec-conf.org/proceedings/lrec2004/pdf/412.pdf