Automatic Extraction of Parallel Speech Corpora from Dubbed Movies

Alp Öktem, Mireia Farrús, Leo Wanner


Abstract
This paper presents a methodology to extract parallel speech corpora based on any language pair from dubbed movies, together with an application framework in which some corresponding prosodic parameters are extracted. The obtained parallel corpora are especially suitable for speech-to-speech translation applications when a prosody transfer between source and target languages is desired.
Anthology ID:
W17-2506
Volume:
Proceedings of the 10th Workshop on Building and Using Comparable Corpora
Month:
August
Year:
2017
Address:
Vancouver, Canada
Venue:
BUCC
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
31–35
Language:
URL:
https://aclanthology.org/W17-2506
DOI:
10.18653/v1/W17-2506
Bibkey:
Cite (ACL):
Alp Öktem, Mireia Farrús, and Leo Wanner. 2017. Automatic Extraction of Parallel Speech Corpora from Dubbed Movies. In Proceedings of the 10th Workshop on Building and Using Comparable Corpora, pages 31–35, Vancouver, Canada. Association for Computational Linguistics.
Cite (Informal):
Automatic Extraction of Parallel Speech Corpora from Dubbed Movies (Öktem et al., BUCC 2017)
Copy Citation:
PDF:
https://preview.aclanthology.org/remove-xml-comments/W17-2506.pdf
Presentation:
 W17-2506.Presentation.pdf