Machine translation of multi-party meetings: segmentation and disfluency removal strategies

Eunah Cho, Jan Niehues, Alex Waibel


Abstract
Translating meetings presents a challenge since multi-speaker speech shows a variety of disfluencies. In this paper we investigate the importance of transforming speech into well-written input prior to translating multi-party meetings. We first analyze the characteristics of this data and establish oracle scores. Sentence segmentation and punctuation are performed using a language model, turn information, or a monolingual translation system. Disfluencies are removed by a CRF model trained on in-domain and out-of-domain data. For comparison, we build a combined CRF model for punctuation insertion and disfluency removal. By applying these models, multi-party meetings are transformed into fluent input for machine translation. We evaluate the models with regard to translation performance and are able to achieve an improvement of 2.1 to 4.9 BLEU points depending on the availability of turn information.
Anthology ID:
2014.iwslt-papers.4
Volume:
Proceedings of the 11th International Workshop on Spoken Language Translation: Papers
Month:
December 4-5
Year:
2014
Address:
Lake Tahoe, California
Venue:
IWSLT
SIG:
Publisher:
Note:
Pages:
176–183
Language:
URL:
https://aclanthology.org/2014.iwslt-papers.4
DOI:
Bibkey:
Cite (ACL):
Eunah Cho, Jan Niehues, and Alex Waibel. 2014. Machine translation of multi-party meetings: segmentation and disfluency removal strategies. In Proceedings of the 11th International Workshop on Spoken Language Translation: Papers, pages 176–183, Lake Tahoe, California.
Cite (Informal):
Machine translation of multi-party meetings: segmentation and disfluency removal strategies (Cho et al., IWSLT 2014)
Copy Citation:
PDF:
https://preview.aclanthology.org/update-css-js/2014.iwslt-papers.4.pdf