2020
pdf
bib
abs
From Speech-to-Speech Translation to Automatic Dubbing
Marcello Federico
|
Robert Enyedi
|
Roberto Barra-Chicote
|
Ritwik Giri
|
Umut Isik
|
Arvindh Krishnaswamy
|
Hassan Sawaf
Proceedings of the 17th International Conference on Spoken Language Translation
We present enhancements to a speech-to-speech translation pipeline in order to perform automatic dubbing. Our architecture features neural machine translation generating output of preferred length, prosodic alignment of the translation with the original speech segments, neural text-to-speech with fine tuning of the duration of each utterance, and, finally, audio rendering to enriches text-to-speech output with background noise and reverberation extracted from the original audio. We report and discuss results of a first subjective evaluation of automatic dubbing of excerpts of TED Talks from English into Italian, which measures the perceived naturalness of automatic dubbing and the relative importance of each proposed enhancement.
2015
pdf
bib
Topic adaptation for machine translation of e-commerce content
Prashant Mathur
|
Marcello Federico
|
Selçuk Köprü
|
Sharam Khadivi
|
Hassan Sawaf
Proceedings of Machine Translation Summit XV: Papers
pdf
bib
MT quality estimation for e-commerce data
José G. C. de Souza
|
Marcello Federico
|
Hassan Sawaf
Proceedings of Machine Translation Summit XV: User Track
2014
pdf
bib
Correcting Keyboard Layout Errors and Homoglyphs in Queries
Derek Barnes
|
Mahesh Joshi
|
Hassan Sawaf
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)
2013
pdf
bib
Selective Combination of Pivot and Direct Statistical Machine Translation Models
Ahmed El Kholy
|
Nizar Habash
|
Gregor Leusch
|
Evgeny Matusov
|
Hassan Sawaf
Proceedings of the Sixth International Joint Conference on Natural Language Processing
pdf
bib
Language Independent Connectivity Strength Features for Phrase Pivot Statistical Machine Translation
Ahmed El Kholy
|
Nizar Habash
|
Gregor Leusch
|
Evgeny Matusov
|
Hassan Sawaf
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
2012
pdf
bib
abs
Automatic Speech Recognition & Hybrid MT for HQ Closed-Captioning & Subtitling for Video Broadcast
Hassan Sawaf
Proceedings of the 10th Conference of the Association for Machine Translation in the Americas: Commercial MT User Program
We describe a system to rapidly generate high-quality closed captions and subtitles for live broadcasted TV shows, using automated components, namely Automatic Speech Recognition and Machine Translation. The human stays in the loop for quality assurance and optional post-editing. We also describe how the system feeds the human edits and corrections back into the different components for improvement of these components and with that of the overall system. We finally describe the operation of this system in a real life environment within a broadcast network, where we implemented the system to transcribe, process broadcast transmissions and generate high-quality closed captions in Arabic and translate these into English subtitles in short time.
2010
pdf
bib
abs
Arabic Dialect Handling in Hybrid Machine Translation
Hassan Sawaf
Proceedings of the 9th Conference of the Association for Machine Translation in the Americas: Research Papers
In this paper, we describe an extension to a hybrid machine translation system for handling dialect Arabic, using a decoding algorithm to normalize non-standard, spontaneous and dialectal Arabic into Modern Standard Arabic. We prove the feasibility of the approach by measuring and comparing machine translation results in terms of BLEU with and without the proposed approach. We show in our tests that on real-live broadcast input with transcriptions of dialectal speech we achieve an increase on BLEU of about 1%, and on web content with dialect text of about 2%.
bib
abs
User-generated System for Critical Document Triage and Exploitation–Version 2011
Kristen Summers
|
Hassan Sawaf
Proceedings of the 9th Conference of the Association for Machine Translation in the Americas: Government MT User Program
CACI has developed and delivered systems for document exploitation and processing to Government customers around the world. Many of these systems include advanced language processing capabilities in order to enable rapid triage of vast collections of foreign language documents, separating the content that requires immediate human attention from the less immediately pressing material. AppTek provides key patent-pending Machine Translation technology for this critical process, rendering material in Arabic, Farsi and other languages into an English rendition that enables both further automated processing and rapid review by monolingual analysts, to identify the documents that require immediate linguist attention. Both CACI and AppTek have been working with customers to develop capabilities that enable them, the users, to be the ones in command of making their systems learn and continuously improve. We will describe how we put this critical user requirement into the systems and the key role that the user's involvement played in this. We will also discuss some of the key components of the system and what the customer-centric evolution of the system will be, including our document translation workflow, the machine translation technology within it, and our approaches to supporting the technology and sustaining its success designed around adapting to user needs.
2008
pdf
bib
abs
Hybrid Machine Translation Applied to Media Monitoring
Hassan Sawaf
|
Braddock Gaskill
|
Michael Veronis
Proceedings of the 8th Conference of the Association for Machine Translation in the Americas: Government and Commercial Uses of MT
2000
pdf
bib
abs
On the Use of Grammar Based Language Models for Statistical Machine Translation
Hassan Sawaf
|
Kai Schütz
|
Hermann Ney
Proceedings of the Sixth International Workshop on Parsing Technologies
In this paper, we describe some concepts of language models beyond the usually used standard trigram and use such language models for statistical machine translation. In statistical machine translation the language model is the a-priori knowledge source of the system about the target language. One important requirement for the language model is the correct word order, given a certain choice of words, and to score the translations generated by the translation model Pr(f1J/eI1), in view of the syntactic context. In addition to standard m-grams with long histories, we examine the use of Part-of-Speech based models as well as linguistically motivated grammars with stochastic parsing as a special type of language model. Translation results are given on the VERBMOBIL task, where translation is performed from German to English, with vocabulary sizes of 6500 and 4000 words, respectively.