@inproceedings{cheng-etal-2019-breaking,
    title = "Breaking the Data Barrier: Towards Robust Speech Translation via Adversarial Stability Training",
    author = "Cheng, Qiao  and
      Fan, Meiyuan  and
      Han, Yaqian  and
      Huang, Jin  and
      Duan, Yitao",
    editor = {Niehues, Jan  and
      Cattoni, Rolando  and
      St{\"u}ker, Sebastian  and
      Negri, Matteo  and
      Turchi, Marco  and
      Ha, Thanh-Le  and
      Salesky, Elizabeth  and
      Sanabria, Ramon  and
      Barrault, Loic  and
      Specia, Lucia  and
      Federico, Marcello},
    booktitle = "Proceedings of the 16th International Conference on Spoken Language Translation",
    month = nov # " 2-3",
    year = "2019",
    address = "Hong Kong",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/ingest-emnlp/2019.iwslt-1.29/",
    abstract = "In a pipeline speech translation system, automatic speech recognition (ASR) system will transmit errors in recognition to the downstream machine translation (MT) system. A standard machine translation system is usually trained on parallel corpus composed of clean text and will perform poorly on text with recognition noise, a gap well known in speech translation community. In this paper, we propose a training architecture which aims at making a neural machine translation model more robust against speech recognition errors. Our approach addresses the encoder and the decoder simultaneously using adversarial learning and data augmentation, respectively. Experimental results on IWSLT2018 speech translation task show that our approach can bridge the gap between the ASR output and the MT input, outperforms the baseline by up to 2.83 BLEU on noisy ASR output, while maintaining close performance on clean text."
}Markdown (Informal)
[Breaking the Data Barrier: Towards Robust Speech Translation via Adversarial Stability Training](https://preview.aclanthology.org/ingest-emnlp/2019.iwslt-1.29/) (Cheng et al., IWSLT 2019)
ACL