A method for translation of paralinguistic information
Takatomo Kano, Sakriani Sakti, Shinnosuke Takamichi, Graham Neubig, Tomoki Toda, Satoshi Nakamura
Abstract
This paper is concerned with speech-to-speech translation that is sensitive to paralinguistic information. From the many different possible paralinguistic features to handle, in this paper we chose duration and power as a first step, proposing a method that can translate these features from input speech to the output speech in continuous space. This is done in a simple and language-independent fashion by training a regression model that maps source language duration and power information into the target language. We evaluate the proposed method on a digit translation task and show that paralinguistic information in input speech appears in output speech, and that this information can be used by target language speakers to detect emphasis.- Anthology ID:
- 2012.iwslt-papers.2
- Volume:
- Proceedings of the 9th International Workshop on Spoken Language Translation: Papers
- Month:
- December 6-7
- Year:
- 2012
- Address:
- Hong Kong, Table of contents
- Venue:
- IWSLT
- SIG:
- SIGSLT
- Publisher:
- Note:
- Pages:
- 158–163
- Language:
- URL:
- https://aclanthology.org/2012.iwslt-papers.2
- DOI:
- Cite (ACL):
- Takatomo Kano, Sakriani Sakti, Shinnosuke Takamichi, Graham Neubig, Tomoki Toda, and Satoshi Nakamura. 2012. A method for translation of paralinguistic information. In Proceedings of the 9th International Workshop on Spoken Language Translation: Papers, pages 158–163, Hong Kong, Table of contents.
- Cite (Informal):
- A method for translation of paralinguistic information (Kano et al., IWSLT 2012)
- PDF:
- https://preview.aclanthology.org/add_acl24_videos/2012.iwslt-papers.2.pdf