Parallel Corpus Development at NVTC

Carol Van Ess-Dykema, Laurie Gerber


Abstract
In this paper, we describe the methods used to develop an exchangeable translation memory bank of sentence-aligned Mandarin Chinese - English sentences. This effort is part of a larger effort, initiated by the National Virtual Translation Center (NVTC), to foster collaboration and sharing of translation memory banks across the Intelligence Community and the Department of Defense. In this paper, we describe our corpus creation process - a largely automated process - highlighting the human interventions that are still deemed necessary. We conclude with a brief discussion of how this work will affect plans for NVTC's new translation management workflow and future research to increase the performance of the automated components of the corpus creation process.
Anthology ID:
2010.amta-government.18
Volume:
Proceedings of the 9th Conference of the Association for Machine Translation in the Americas: Government MT User Program
Month:
October 31-November 4
Year:
2010
Address:
Denver, Colorado, USA
Venue:
AMTA
SIG:
Publisher:
Association for Machine Translation in the Americas
Note:
Pages:
Language:
URL:
https://aclanthology.org/2010.amta-government.19
DOI:
Bibkey:
Cite (ACL):
Carol Van Ess-Dykema and Laurie Gerber. 2010. Parallel Corpus Development at NVTC. In Proceedings of the 9th Conference of the Association for Machine Translation in the Americas: Government MT User Program, Denver, Colorado, USA. Association for Machine Translation in the Americas.
Cite (Informal):
Parallel Corpus Development at NVTC (Van Ess-Dykema & Gerber, AMTA 2010)
Copy Citation:
PDF:
https://preview.aclanthology.org/auto-file-uploads/2010.amta-government.19.pdf