Design and implementation of controlled elicitation for machine translation of low-density languages

Katharina Probst, Ralf Brown, Jaime Carbonell, Alon Lavie, Lori Levin, Erik Peterson


Abstract
NICE is a machine translation project for low-density languages. We are building a tool that will elicit a controlled corpus from a bilingual speaker who is not an expert in linguistics. The corpus is intended to cover major typological phenomena, as it is designed to work for any language. Using implicational universals, we strive to minimize the number of sentences that each informant has to translate. From the elicited sentences, we learn transfer rules with a version space algorithm. Our vision for MT in the future is one in which systems can be quickly trained for new languages by native speakers, so that speakers of minor languages can participate in education, health care, government, and internet without having to give up their languages.
Anthology ID:
2001.mtsummit-road.7
Volume:
Workshop on MT2010: Towards a Road Map for MT
Month:
September 18-22
Year:
2001
Address:
Santiago de Compostela, Spain
Venue:
MTSummit
SIG:
Publisher:
Note:
Pages:
Language:
URL:
https://aclanthology.org/2001.mtsummit-road.7
DOI:
Bibkey:
Cite (ACL):
Katharina Probst, Ralf Brown, Jaime Carbonell, Alon Lavie, Lori Levin, and Erik Peterson. 2001. Design and implementation of controlled elicitation for machine translation of low-density languages. In Workshop on MT2010: Towards a Road Map for MT, Santiago de Compostela, Spain.
Cite (Informal):
Design and implementation of controlled elicitation for machine translation of low-density languages (Probst et al., MTSummit 2001)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2001.mtsummit-road.7.pdf