The HW-TSC’s Simultaneous Speech-to-Speech Translation System for IWSLT 2023 Evaluation

Hengchao Shang; Zhiqiang Rao; Zongyao Li; Zhanglin Wu; Jiaxin Guo; Minghan Wang; Daimeng Wei; Shaojun Li; Zhengzhe Yu; Xiaoyu Chen; Lizhi Lei; Hao Yang

doi:10.18653/v1/2023.iwslt-1.36

The HW-TSC’s Simultaneous Speech-to-Speech Translation System for IWSLT 2023 Evaluation

Hengchao Shang, Zhiqiang Rao, Zongyao Li, Zhanglin Wu, Jiaxin Guo, Minghan Wang, Daimeng Wei, Shaojun Li, Zhengzhe Yu, Xiaoyu Chen, Lizhi Lei, Hao Yang

Abstract

In this paper, we present our submission to the IWSLT 2023 Simultaneous Speech-to-Speech Translation competition. Our participation involves three language directions: English-German, English-Chinese, and English-Japanese. Our solution is a cascaded incremental decoding system, consisting of an ASR model, an MT model, and a TTS model. By adopting the strategies used in the Speech-to-Text track, we have managed to generate a more confident target text for each audio segment input, which can guide the next MT incremental decoding process. Additionally, we have integrated the TTS model to seamlessly reproduce audio files from the translation hypothesis. To enhance the effectiveness of our experiment, we have utilized a range of methods to reduce error conditions in the TTS input text and improve the smoothness of the TTS output audio.

Anthology ID:: 2023.iwslt-1.36
Volume:: Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023)
Month:: July
Year:: 2023
Address:: Toronto, Canada (in-person and online)
Editors:: Elizabeth Salesky, Marcello Federico, Marine Carpuat
Venue:: IWSLT
SIG:: SIGSLT
Publisher:: Association for Computational Linguistics
Note:
Pages:: 383–388
Language:
URL:: https://aclanthology.org/2023.iwslt-1.36
DOI:: 10.18653/v1/2023.iwslt-1.36
Bibkey:
Cite (ACL):: Hengchao Shang, Zhiqiang Rao, Zongyao Li, Zhanglin Wu, Jiaxin Guo, Minghan Wang, Daimeng Wei, Shaojun Li, Zhengzhe Yu, Xiaoyu Chen, Lizhi Lei, and Hao Yang. 2023. The HW-TSC’s Simultaneous Speech-to-Speech Translation System for IWSLT 2023 Evaluation. In Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023), pages 383–388, Toronto, Canada (in-person and online). Association for Computational Linguistics.
Cite (Informal):: The HW-TSC’s Simultaneous Speech-to-Speech Translation System for IWSLT 2023 Evaluation (Shang et al., IWSLT 2023)
Copy Citation:
PDF:: https://preview.aclanthology.org/naacl-24-ws-corrections/2023.iwslt-1.36.pdf

PDF Search