Abstract
Disfluencies in speech pose severe difficulties in machine translation of spontaneous speech. This paper presents our conditional random field (CRF)-based speech disfluency detection system developed on German to improve spoken language translation performance. In order to detect speech disfluencies considering syntactics and semantics of speech utterances, we carried out a CRF-based approach using information learned from the word representation and the phrase table used for machine translation. The word representation is gained using recurrent neural networks and projected words are clustered using the k-means algorithm. Using the output from the model trained with the word representations and phrase table information, we achieve an improvement of 1.96 BLEU points on the lecture test set. By keeping or removing humanannotated disfluencies, we show an upper bound and lower bound of translation quality. In an oracle experiment we gain 3.16 BLEU points of improvement on the lecture test set, compared to the same set with all disfluencies.- Anthology ID:
- 2013.iwslt-papers.12
- Volume:
- Proceedings of the 10th International Workshop on Spoken Language Translation: Papers
- Month:
- December 5-6
- Year:
- 2013
- Address:
- Heidelberg, Germany
- Editor:
- Joy Ying Zhang
- Venue:
- IWSLT
- SIG:
- SIGSLT
- Publisher:
- Note:
- Pages:
- Language:
- URL:
- https://aclanthology.org/2013.iwslt-papers.12
- DOI:
- Cite (ACL):
- Eunah Cho, Than-Le Ha, and Alex Waibel. 2013. CRF-based disfluency detection using semantic features for German to English spoken language translation. In Proceedings of the 10th International Workshop on Spoken Language Translation: Papers, Heidelberg, Germany.
- Cite (Informal):
- CRF-based disfluency detection using semantic features for German to English spoken language translation (Cho et al., IWSLT 2013)
- PDF:
- https://preview.aclanthology.org/naacl-24-ws-corrections/2013.iwslt-papers.12.pdf