The HW-TSC’s Speech to Speech Translation System for IWSLT 2022 Evaluation

Jiaxin Guo, Yinglu Li, Minghan Wang, Xiaosong Qiao, Yuxia Wang, Hengchao Shang, Chang Su, Yimeng Chen, Min Zhang, Shimin Tao, Hao Yang, Ying Qin


Abstract
The paper presents the HW-TSC’s pipeline and results of Offline Speech to Speech Translation for IWSLT 2022. We design a cascade system consisted of an ASR model, machine translation model and TTS model to convert the speech from one language into another language(en-de). For the ASR part, we find that better performance can be obtained by ensembling multiple heterogeneous ASR models and performing reranking on beam candidates. And we find that the combination of context-aware reranking strategy and MT model fine-tuned on the in-domain dataset is helpful to improve the performance. Because it can mitigate the problem that the inconsistency in transcripts caused by the lack of context. Finally, we use VITS model provided officially to reproduce audio files from the translation hypothesis.
Anthology ID:
2022.iwslt-1.26
Volume:
Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT 2022)
Month:
May
Year:
2022
Address:
Dublin, Ireland (in-person and online)
Venue:
IWSLT
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
293–297
Language:
URL:
https://aclanthology.org/2022.iwslt-1.26
DOI:
10.18653/v1/2022.iwslt-1.26
Bibkey:
Cite (ACL):
Jiaxin Guo, Yinglu Li, Minghan Wang, Xiaosong Qiao, Yuxia Wang, Hengchao Shang, Chang Su, Yimeng Chen, Min Zhang, Shimin Tao, Hao Yang, and Ying Qin. 2022. The HW-TSC’s Speech to Speech Translation System for IWSLT 2022 Evaluation. In Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT 2022), pages 293–297, Dublin, Ireland (in-person and online). Association for Computational Linguistics.
Cite (Informal):
The HW-TSC’s Speech to Speech Translation System for IWSLT 2022 Evaluation (Guo et al., IWSLT 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2022.iwslt-1.26.pdf
Data
LibriSpeechTED-LIUM 3