Statistical machine translation using large J/E parallel corpus and long phrase tables

Jin’ichi Murakami; Masato Tokuhisa; Satoru Ikehara

Statistical machine translation using large J/E parallel corpus and long phrase tables

Jin’ichi Murakami, Masato Tokuhisa, Satoru Ikehara

Abstract

Our statistical machine translation system that uses large Japanese-English parallel sentences and long phrase tables is described. We collected 698,973 Japanese-English parallel sentences, and we used long phrase tables. Also, we utilized general tools for statistical machine translation, such as ”Giza++”[1], ”moses”[2], and ”training-phrasemodel.perl”[3]. We used these data and these tools, We challenge the contest for IWSLT07. In which task was the result (0.4321 BLEU) obtained.

Anthology ID:: 2007.iwslt-1.23
Volume:: Proceedings of the Fourth International Workshop on Spoken Language Translation
Month:: October 15-16
Year:: 2007
Address:: Trento, Italy
Venue:: IWSLT
SIG:: SIGSLT
Publisher:
Note:
Pages:
Language:
URL:: https://aclanthology.org/2007.iwslt-1.23
DOI:
Bibkey:
Cite (ACL):: Jin’ichi Murakami, Masato Tokuhisa, and Satoru Ikehara. 2007. Statistical machine translation using large J/E parallel corpus and long phrase tables. In Proceedings of the Fourth International Workshop on Spoken Language Translation, Trento, Italy.
Cite (Informal):: Statistical machine translation using large J/E parallel corpus and long phrase tables (Murakami et al., IWSLT 2007)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingestion-script-update/2007.iwslt-1.23.pdf

PDF Search