The Volctrans Neural Speech Translation System for IWSLT 2021
Chengqi Zhao, Zhicheng Liu, Jian Tong, Tao Wang, Mingxuan Wang, Rong Ye, Qianqian Dong, Jun Cao, Lei Li
Abstract
This paper describes the systems submitted to IWSLT 2021 by the Volctrans team. We participate in the offline speech translation and text-to-text simultaneous translation tracks. For offline speech translation, our best end-to-end model achieves 7.9 BLEU improvements over the benchmark on the MuST-C test set and is even approaching the results of a strong cascade solution. For text-to-text simultaneous translation, we explore the best practice to optimize the wait-k model. As a result, our final submitted systems exceed the benchmark at around 7 BLEU on the same latency regime. We release our code and model to facilitate both future research works and industrial applications.- Anthology ID:
- 2021.iwslt-1.6
- Volume:
- Proceedings of the 18th International Conference on Spoken Language Translation (IWSLT 2021)
- Month:
- August
- Year:
- 2021
- Address:
- Bangkok, Thailand (online)
- Editors:
- Marcello Federico, Alex Waibel, Marta R. Costa-jussà, Jan Niehues, Sebastian Stuker, Elizabeth Salesky
- Venue:
- IWSLT
- SIG:
- SIGSLT
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 64–74
- Language:
- URL:
- https://aclanthology.org/2021.iwslt-1.6
- DOI:
- 10.18653/v1/2021.iwslt-1.6
- Cite (ACL):
- Chengqi Zhao, Zhicheng Liu, Jian Tong, Tao Wang, Mingxuan Wang, Rong Ye, Qianqian Dong, Jun Cao, and Lei Li. 2021. The Volctrans Neural Speech Translation System for IWSLT 2021. In Proceedings of the 18th International Conference on Spoken Language Translation (IWSLT 2021), pages 64–74, Bangkok, Thailand (online). Association for Computational Linguistics.
- Cite (Informal):
- The Volctrans Neural Speech Translation System for IWSLT 2021 (Zhao et al., IWSLT 2021)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-2/2021.iwslt-1.6.pdf
- Code
- bytedance/neurst
- Data
- LibriSpeech, MuST-C