2022
pdf
abs
The NiuTrans’s Submission to the IWSLT22 English-to-Chinese Offline Speech Translation Task
Yuhao Zhang
|
Canan Huang
|
Chen Xu
|
Xiaoqian Liu
|
Bei Li
|
Anxiang Ma
|
Tong Xiao
|
Jingbo Zhu
Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT 2022)
This paper describes NiuTrans’s submission to the IWSLT22 English-to-Chinese (En-Zh) offline speech translation task. The end-to-end and bilingual system is built by constrained English and Chinese data and translates the English speech to Chinese text without intermediate transcription. Our speech translation models are composed of different pre-trained acoustic models and machine translation models by two kinds of adapters. We compared the effect of the standard speech feature (e.g. log Mel-filterbank) and the pre-training speech feature and try to make them interact. The final submission is an ensemble of three potential speech translation models. Our single best and ensemble model achieves 18.66 BLEU and 19.35 BLEU separately on MuST-C En-Zh tst-COMMON set.
2021
pdf
abs
The NiuTrans Machine Translation Systems for WMT21
Shuhan Zhou
|
Tao Zhou
|
Binghao Wei
|
Yingfeng Luo
|
Yongyu Mu
|
Zefan Zhou
|
Chenglong Wang
|
Xuanjun Zhou
|
Chuanhao Lv
|
Yi Jing
|
Laohu Wang
|
Jingnan Zhang
|
Canan Huang
|
Zhongxiang Yan
|
Chi Hu
|
Bei Li
|
Tong Xiao
|
Jingbo Zhu
Proceedings of the Sixth Conference on Machine Translation
This paper describes NiuTrans neural machine translation systems of the WMT 2021 news translation tasks. We made submissions to 9 language directions, including English2Chinese, Japanese, Russian, Icelandic and English2Hausa tasks. Our primary systems are built on several effective variants of Transformer, e.g., Transformer-DLCL, ODE-Transformer. We also utilize back-translation, knowledge distillation, post-ensemble, and iterative fine-tuning techniques to enhance the model performance further.
pdf
abs
The NiuTrans End-to-End Speech Translation System for IWSLT 2021 Offline Task
Chen Xu
|
Xiaoqian Liu
|
Xiaowen Liu
|
Tiger Wang
|
Canan Huang
|
Tong Xiao
|
Jingbo Zhu
Proceedings of the 18th International Conference on Spoken Language Translation (IWSLT 2021)
This paper describes the submission of the NiuTrans end-to-end speech translation system for the IWSLT 2021 offline task, which translates from the English audio to German text directly without intermediate transcription. We use the Transformer-based model architecture and enhance it by Conformer, relative position encoding, and stacked acoustic and textual encoding. To augment the training data, the English transcriptions are translated to German translations. Finally, we employ ensemble decoding to integrate the predictions from several models trained with the different datasets. Combining these techniques, we achieve 33.84 BLEU points on the MuST-C En-De test set, which shows the enormous potential of the end-to-end model.