The Johns Hopkins University 2003 Chinese-English machine translation system

W. Byrne, S. Khudanpur, W. Kim, S. Kumar, P. Pecina, P. Virga, P. Xu, D. Yarowsky


Abstract
We describe a Chinese to English Machine Translation system developed at the Johns Hopkins University for the NIST 2003 MT evaluation. The system is based on a Weighted Finite State Transducer implementation of the alignment template translation model for statistical machine translation. The baseline MT system was trained using 100,000 sentence pairs selected from a static bitext training collection. Information retrieval techniques were then used to create specific training collections for each document to be translated. This document-specific training set included bitext and name entities that were then added to the baseline system by augmenting the library of alignment templates. We report translation performance of baseline and IR-based systems on two NIST MT evaluation test sets.
Anthology ID:
2003.mtsummit-systems.3
Volume:
Proceedings of Machine Translation Summit IX: System Presentations
Month:
September 23-27
Year:
2003
Address:
New Orleans, USA
Venue:
MTSummit
SIG:
Publisher:
Note:
Pages:
Language:
URL:
https://aclanthology.org/2003.mtsummit-systems.3
DOI:
Bibkey:
Cite (ACL):
W. Byrne, S. Khudanpur, W. Kim, S. Kumar, P. Pecina, P. Virga, P. Xu, and D. Yarowsky. 2003. The Johns Hopkins University 2003 Chinese-English machine translation system. In Proceedings of Machine Translation Summit IX: System Presentations, New Orleans, USA.
Cite (Informal):
The Johns Hopkins University 2003 Chinese-English machine translation system (Byrne et al., MTSummit 2003)
Copy Citation:
PDF:
https://preview.aclanthology.org/update-css-js/2003.mtsummit-systems.3.pdf