Abstract
This paper describes a new alignment method that extracts high quality multi-word alignments from sentence-aligned multilingual parallel corpora. The method can handle several languages at once. The phrase tables obtained by the method have a comparable accuracy and a higher coverage than those obtained by current methods. They are also obtained much faster.- Anthology ID:
- 2008.amta-papers.11
- Volume:
- Proceedings of the 8th Conference of the Association for Machine Translation in the Americas: Research Papers
- Month:
- October 21-25
- Year:
- 2008
- Address:
- Waikiki, USA
- Venue:
- AMTA
- SIG:
- Publisher:
- Association for Machine Translation in the Americas
- Note:
- Pages:
- 125–132
- Language:
- URL:
- https://aclanthology.org/2008.amta-papers.11
- DOI:
- Cite (ACL):
- Adrien Lardilleux and Yves Lepage. 2008. A truly multilingual, high coverage, accurate, yet simple, subsentential alignment method. In Proceedings of the 8th Conference of the Association for Machine Translation in the Americas: Research Papers, pages 125–132, Waikiki, USA. Association for Machine Translation in the Americas.
- Cite (Informal):
- A truly multilingual, high coverage, accurate, yet simple, subsentential alignment method (Lardilleux & Lepage, AMTA 2008)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/2008.amta-papers.11.pdf