Automatic Speech Recognition & Hybrid MT for HQ Closed-Captioning & Subtitling for Video Broadcast

Hassan Sawaf


Abstract
We describe a system to rapidly generate high-quality closed captions and subtitles for live broadcasted TV shows, using automated components, namely Automatic Speech Recognition and Machine Translation. The human stays in the loop for quality assurance and optional post-editing. We also describe how the system feeds the human edits and corrections back into the different components for improvement of these components and with that of the overall system. We finally describe the operation of this system in a real life environment within a broadcast network, where we implemented the system to transcribe, process broadcast transmissions and generate high-quality closed captions in Arabic and translate these into English subtitles in short time.
Anthology ID:
2012.amta-commercial.15
Volume:
Proceedings of the 10th Conference of the Association for Machine Translation in the Americas: Commercial MT User Program
Month:
October 28-November 1
Year:
2012
Address:
San Diego, California, USA
Venue:
AMTA
SIG:
Publisher:
Association for Machine Translation in the Americas
Note:
Pages:
Language:
URL:
https://aclanthology.org/2012.amta-commercial.15
DOI:
Bibkey:
Cite (ACL):
Hassan Sawaf. 2012. Automatic Speech Recognition & Hybrid MT for HQ Closed-Captioning & Subtitling for Video Broadcast. In Proceedings of the 10th Conference of the Association for Machine Translation in the Americas: Commercial MT User Program, San Diego, California, USA. Association for Machine Translation in the Americas.
Cite (Informal):
Automatic Speech Recognition & Hybrid MT for HQ Closed-Captioning & Subtitling for Video Broadcast (Sawaf, AMTA 2012)
Copy Citation:
PDF:
https://preview.aclanthology.org/update-css-js/2012.amta-commercial.15.pdf