A method for translation of paralinguistic information

Takatomo Kano, Sakriani Sakti, Shinnosuke Takamichi, Graham Neubig, Tomoki Toda, Satoshi Nakamura


Abstract
This paper is concerned with speech-to-speech translation that is sensitive to paralinguistic information. From the many different possible paralinguistic features to handle, in this paper we chose duration and power as a first step, proposing a method that can translate these features from input speech to the output speech in continuous space. This is done in a simple and language-independent fashion by training a regression model that maps source language duration and power information into the target language. We evaluate the proposed method on a digit translation task and show that paralinguistic information in input speech appears in output speech, and that this information can be used by target language speakers to detect emphasis.
Anthology ID:
2012.iwslt-papers.2
Volume:
Proceedings of the 9th International Workshop on Spoken Language Translation: Papers
Month:
December 6-7
Year:
2012
Address:
Hong Kong, Table of contents
Venue:
IWSLT
SIG:
SIGSLT
Publisher:
Note:
Pages:
158–163
Language:
URL:
https://aclanthology.org/2012.iwslt-papers.2
DOI:
Bibkey:
Cite (ACL):
Takatomo Kano, Sakriani Sakti, Shinnosuke Takamichi, Graham Neubig, Tomoki Toda, and Satoshi Nakamura. 2012. A method for translation of paralinguistic information. In Proceedings of the 9th International Workshop on Spoken Language Translation: Papers, pages 158–163, Hong Kong, Table of contents.
Cite (Informal):
A method for translation of paralinguistic information (Kano et al., IWSLT 2012)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2012.iwslt-papers.2.pdf