Guarani: A Case Study in Resource Development for Quick Ramp-Up MT

Ahmed Abdelali, James Cowie, Steve Helmreich, Wanying Jin, Maria Pilar Milagros, Bill Ogden, Mansouri Rad, Ron Zacharski


Abstract
In this paper we describe a set of processes for the acquisition of re­sources for quick ramp­up machine translation (MT) from any language lacking significant machine tracta­ble resources into English, using the Paraguayan indigenous lan­guage Guarani as well as Amharic and Chechen, as examples. Our task is to develop a 250,000 mono­lingual corpus, a 250,000 bilingual parallel corpus, and smaller corpora tagged with part of speech, named entity, and morphological annota­tions.
Anthology ID:
2006.amta-papers.1
Volume:
Proceedings of the 7th Conference of the Association for Machine Translation in the Americas: Technical Papers
Month:
August 8-12
Year:
2006
Address:
Cambridge, Massachusetts, USA
Venue:
AMTA
SIG:
Publisher:
Association for Machine Translation in the Americas
Note:
Pages:
1–9
Language:
URL:
https://aclanthology.org/2006.amta-papers.1
DOI:
Bibkey:
Cite (ACL):
Ahmed Abdelali, James Cowie, Steve Helmreich, Wanying Jin, Maria Pilar Milagros, Bill Ogden, Mansouri Rad, and Ron Zacharski. 2006. Guarani: A Case Study in Resource Development for Quick Ramp-Up MT. In Proceedings of the 7th Conference of the Association for Machine Translation in the Americas: Technical Papers, pages 1–9, Cambridge, Massachusetts, USA. Association for Machine Translation in the Americas.
Cite (Informal):
Guarani: A Case Study in Resource Development for Quick Ramp-Up MT (Abdelali et al., AMTA 2006)
Copy Citation:
PDF:
https://preview.aclanthology.org/remove-xml-comments/2006.amta-papers.1.pdf