Abstract
We present the University of Edinburgh’s submission for the IWSLT 2007 shared task. Our efforts focused on adapting our statistical machine translation system to the open data conditions for the Italian-English task of the evaluation campaign. We examine the challenges of building a system with a limited set of in-domain development data (SITAL), a small training corpus in a related but distinct domain (BTEC), and a large out of domain corpus (Europarl). We concentrated on the corrected text track, and present additional results of our experiments using the open-source Moses MT system with speech input.- Anthology ID:
- 2007.iwslt-1.6
- Volume:
- Proceedings of the Fourth International Workshop on Spoken Language Translation
- Month:
- October 15-16
- Year:
- 2007
- Address:
- Trento, Italy
- Venue:
- IWSLT
- SIG:
- SIGSLT
- Publisher:
- Note:
- Pages:
- Language:
- URL:
- https://aclanthology.org/2007.iwslt-1.6
- DOI:
- Cite (ACL):
- Josh Schroeder and Philipp Koehn. 2007. The University of Edinburgh system description for IWSLT 2007. In Proceedings of the Fourth International Workshop on Spoken Language Translation, Trento, Italy.
- Cite (Informal):
- The University of Edinburgh system description for IWSLT 2007 (Schroeder & Koehn, IWSLT 2007)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-2/2007.iwslt-1.6.pdf