System Description on Automatic Simultaneous Translation Workshop

Zecheng Li, Yue Sun, Haoze Li


Abstract
This paper describes our system submitted on the third automatic simultaneous translation workshop at NAACL2022. We participate in the Chinese audio->English text direction of Chinese-to-English translation. Our speech-to-text system is a pipeline system, in which we resort to rhymological features for audio split, ASRT model for speech recoginition, STACL model for streaming text translation. To translate streaming text, we use wait-k policy trained to generate the target sentence concurrently with the source sentence, but always k words behind. We propose a competitive simultaneous translation system and rank 3rd in the audio input track. The code will release soon.
Anthology ID:
2022.autosimtrans-1.3
Volume:
Proceedings of the Third Workshop on Automatic Simultaneous Translation
Month:
July
Year:
2022
Address:
Online
Venue:
AutoSimTrans
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
18–21
Language:
URL:
https://aclanthology.org/2022.autosimtrans-1.3
DOI:
10.18653/v1/2022.autosimtrans-1.3
Bibkey:
Cite (ACL):
Zecheng Li, Yue Sun, and Haoze Li. 2022. System Description on Automatic Simultaneous Translation Workshop. In Proceedings of the Third Workshop on Automatic Simultaneous Translation, pages 18–21, Online. Association for Computational Linguistics.
Cite (Informal):
System Description on Automatic Simultaneous Translation Workshop (Li et al., AutoSimTrans 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/auto-file-uploads/2022.autosimtrans-1.3.pdf
Data
AISHELL-1THCHS-30