Statistical machine translation using large J/E parallel corpus and long phrase tables

Jin’ichi Murakami, Masato Tokuhisa, Satoru Ikehara


Abstract
Our statistical machine translation system that uses large Japanese-English parallel sentences and long phrase tables is described. We collected 698,973 Japanese-English parallel sentences, and we used long phrase tables. Also, we utilized general tools for statistical machine translation, such as ”Giza++”[1], ”moses”[2], and ”training-phrasemodel.perl”[3]. We used these data and these tools, We challenge the contest for IWSLT07. In which task was the result (0.4321 BLEU) obtained.
Anthology ID:
2007.iwslt-1.23
Volume:
Proceedings of the Fourth International Workshop on Spoken Language Translation
Month:
October 15-16
Year:
2007
Address:
Trento, Italy
Venue:
IWSLT
SIG:
SIGSLT
Publisher:
Note:
Pages:
Language:
URL:
https://aclanthology.org/2007.iwslt-1.23
DOI:
Bibkey:
Cite (ACL):
Jin’ichi Murakami, Masato Tokuhisa, and Satoru Ikehara. 2007. Statistical machine translation using large J/E parallel corpus and long phrase tables. In Proceedings of the Fourth International Workshop on Spoken Language Translation, Trento, Italy.
Cite (Informal):
Statistical machine translation using large J/E parallel corpus and long phrase tables (Murakami et al., IWSLT 2007)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2007.iwslt-1.23.pdf