Quality Analysis of Patent Parallel Corpus by the Scale
Isamu Okada, Shinichiro Miyazawa, Kazunari Ishida, Nobuhiko Shimizu, Toshizumi Ohta
Abstract
Large-scale parallel corpus is extremely important for translation memory, example-based machine translation, and the support system to create English sentences. Organized collection or establishment of large-scale corpus is currently ongoing; however it is a difficult project in terms of copyrights as well as economic efficiency. To investigate general tendency of large-scale corpus helps to improve economical efficiency of parallel corpus collection as well as system establishment. In this study, therefore, the relationship between the scale of parallel corpus and the degree of correspondence is clarified, using parallel corpus for patents.- Anthology ID:
- 2005.mtsummit-wpt.5
- Volume:
- Workshop on patent translation
- Month:
- September 13-15
- Year:
- 2005
- Address:
- Phuket, Thailand
- Venue:
- MTSummit
- SIG:
- Publisher:
- Note:
- Pages:
- 29–34
- Language:
- URL:
- https://aclanthology.org/2005.mtsummit-wpt.5
- DOI:
- Cite (ACL):
- Isamu Okada, Shinichiro Miyazawa, Kazunari Ishida, Nobuhiko Shimizu, and Toshizumi Ohta. 2005. Quality Analysis of Patent Parallel Corpus by the Scale. In Workshop on patent translation, pages 29–34, Phuket, Thailand.
- Cite (Informal):
- Quality Analysis of Patent Parallel Corpus by the Scale (Okada et al., MTSummit 2005)
- PDF:
- https://preview.aclanthology.org/paclic-22-ingestion/2005.mtsummit-wpt.5.pdf