Machine Translation for Triage and Exploitation of Massive Text Data

James E. Andrews, Kristen Summers


Abstract
The National Ground Intelligence Center (NGIC) collects massive quantities of textual data in foreign languages. To support exploitation in light of intelligence requirements, a triage process must be applied to this data as those requirements emerge, to identify the most useful data for further exploitation. Machine translation provides critical support for this triage. This paper outlines the types of collected data and the different challenges they present for machine translation, as well as the types of triage to support for collections of this nature, and the issues raised for machine translation by those uses.
Anthology ID:
2008.amta-govandcom.1
Volume:
Proceedings of the 8th Conference of the Association for Machine Translation in the Americas: Government and Commercial Uses of MT
Month:
October 21-25
Year:
2008
Address:
Waikiki, USA
Venue:
AMTA
SIG:
Publisher:
Association for Machine Translation in the Americas
Note:
Pages:
291–298
Language:
URL:
https://aclanthology.org/2008.amta-govandcom.1
DOI:
Bibkey:
Cite (ACL):
James E. Andrews and Kristen Summers. 2008. Machine Translation for Triage and Exploitation of Massive Text Data. In Proceedings of the 8th Conference of the Association for Machine Translation in the Americas: Government and Commercial Uses of MT, pages 291–298, Waikiki, USA. Association for Machine Translation in the Americas.
Cite (Informal):
Machine Translation for Triage and Exploitation of Massive Text Data (Andrews & Summers, AMTA 2008)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-4/2008.amta-govandcom.1.pdf