Abstract
This paper presents an attempt at developing a technique of acquiring translation pairs of technical terms with sufficiently high precision from parallel patent documents. The approach taken in the proposed technique is based on integrating the phrase translation table of a state-of-the-art statistical phrase-based machine translation model, and compositional translation generation based on an existing bilingual lexicon for human use. Our evaluation results clearly show that the agreement between the two individual techniques definitely contribute to improving precision of translation candidates. We then apply the Support Vector Machines (SVMs) to the task of automatically validating translation candidates in the phrase translation table. Experimental evaluation results again show that the SVMs based approach to translation candidates validation can contribute to improving the precision of translation candidates in the phrase translation table.- Anthology ID:
- 2008.amta-papers.14
- Volume:
- Proceedings of the 8th Conference of the Association for Machine Translation in the Americas: Research Papers
- Month:
- October 21-25
- Year:
- 2008
- Address:
- Waikiki, USA
- Venue:
- AMTA
- SIG:
- Publisher:
- Association for Machine Translation in the Americas
- Note:
- Pages:
- 153–162
- Language:
- URL:
- https://aclanthology.org/2008.amta-papers.14
- DOI:
- Cite (ACL):
- Yohei Morishita, Takehito Utsuro, and Mikio Yamamoto. 2008. Integrating a Phrase-based SMT Model and a Bilingual Lexicon for Semi-Automatic Acquisition of Technical Term Translation Lexicons. In Proceedings of the 8th Conference of the Association for Machine Translation in the Americas: Research Papers, pages 153–162, Waikiki, USA. Association for Machine Translation in the Americas.
- Cite (Informal):
- Integrating a Phrase-based SMT Model and a Bilingual Lexicon for Semi-Automatic Acquisition of Technical Term Translation Lexicons (Morishita et al., AMTA 2008)
- PDF:
- https://preview.aclanthology.org/add_acl24_videos/2008.amta-papers.14.pdf