Guarani: A Case Study in Resource Development for Quick Ramp-Up MT
Ahmed Abdelali, James Cowie, Steve Helmreich, Wanying Jin, Maria Pilar Milagros, Bill Ogden, Mansouri Rad, Ron Zacharski
Abstract
In this paper we describe a set of processes for the acquisition of resources for quick rampup machine translation (MT) from any language lacking significant machine tractable resources into English, using the Paraguayan indigenous language Guarani as well as Amharic and Chechen, as examples. Our task is to develop a 250,000 monolingual corpus, a 250,000 bilingual parallel corpus, and smaller corpora tagged with part of speech, named entity, and morphological annotations.- Anthology ID:
- 2006.amta-papers.1
- Volume:
- Proceedings of the 7th Conference of the Association for Machine Translation in the Americas: Technical Papers
- Month:
- August 8-12
- Year:
- 2006
- Address:
- Cambridge, Massachusetts, USA
- Venue:
- AMTA
- SIG:
- Publisher:
- Association for Machine Translation in the Americas
- Note:
- Pages:
- 1–9
- Language:
- URL:
- https://aclanthology.org/2006.amta-papers.1
- DOI:
- Cite (ACL):
- Ahmed Abdelali, James Cowie, Steve Helmreich, Wanying Jin, Maria Pilar Milagros, Bill Ogden, Mansouri Rad, and Ron Zacharski. 2006. Guarani: A Case Study in Resource Development for Quick Ramp-Up MT. In Proceedings of the 7th Conference of the Association for Machine Translation in the Americas: Technical Papers, pages 1–9, Cambridge, Massachusetts, USA. Association for Machine Translation in the Americas.
- Cite (Informal):
- Guarani: A Case Study in Resource Development for Quick Ramp-Up MT (Abdelali et al., AMTA 2006)
- PDF:
- https://preview.aclanthology.org/ingest-2024-clasp/2006.amta-papers.1.pdf