Abstract
This paper presents a methodology to extract parallel speech corpora based on any language pair from dubbed movies, together with an application framework in which some corresponding prosodic parameters are extracted. The obtained parallel corpora are especially suitable for speech-to-speech translation applications when a prosody transfer between source and target languages is desired.- Anthology ID:
- W17-2506
- Volume:
- Proceedings of the 10th Workshop on Building and Using Comparable Corpora
- Month:
- August
- Year:
- 2017
- Address:
- Vancouver, Canada
- Venue:
- BUCC
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 31–35
- Language:
- URL:
- https://aclanthology.org/W17-2506
- DOI:
- 10.18653/v1/W17-2506
- Cite (ACL):
- Alp Öktem, Mireia Farrús, and Leo Wanner. 2017. Automatic Extraction of Parallel Speech Corpora from Dubbed Movies. In Proceedings of the 10th Workshop on Building and Using Comparable Corpora, pages 31–35, Vancouver, Canada. Association for Computational Linguistics.
- Cite (Informal):
- Automatic Extraction of Parallel Speech Corpora from Dubbed Movies (Öktem et al., BUCC 2017)
- PDF:
- https://preview.aclanthology.org/remove-xml-comments/W17-2506.pdf