Statistical Machine Translation of English-Manipuri using Morpho-syntactic and Semantic Information

Thoudam Doren Singh, Savaji Bandyopadhyay


Abstract
English-Manipuri language pair is one of the rarely investigated with restricted bilingual resources. The development of a factored Statistical Machine Translation (SMT) system between English as source and Manipuri, a morphologically rich language as target is reported. The role of the suffixes and dependency relations on the source side and case markers on the target side are identified as important translation factors. The morphology and dependency relations play important roles to improve the translation quality. A parallel corpus of 10350 sentences from news domain is used for training and the system is tested with 500 sentences. Using the proposed translation factors, the output of the translation quality is improved as indicated by the BLEU score and subjective evaluation.
Anthology ID:
2010.amta-srw.1
Volume:
Proceedings of the 9th Conference of the Association for Machine Translation in the Americas: Student Research Workshop
Month:
October 31-November 4
Year:
2010
Address:
Denver, Colorado, USA
Venue:
AMTA
SIG:
Publisher:
Association for Machine Translation in the Americas
Note:
Pages:
Language:
URL:
https://aclanthology.org/2010.amta-srw.1
DOI:
Bibkey:
Cite (ACL):
Thoudam Doren Singh and Savaji Bandyopadhyay. 2010. Statistical Machine Translation of English-Manipuri using Morpho-syntactic and Semantic Information. In Proceedings of the 9th Conference of the Association for Machine Translation in the Americas: Student Research Workshop, Denver, Colorado, USA. Association for Machine Translation in the Americas.
Cite (Informal):
Statistical Machine Translation of English-Manipuri using Morpho-syntactic and Semantic Information (Singh & Bandyopadhyay, AMTA 2010)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2010.amta-srw.1.pdf