Quality Analysis of Patent Parallel Corpus by the Scale

Isamu Okada, Shinichiro Miyazawa, Kazunari Ishida, Nobuhiko Shimizu, Toshizumi Ohta


Abstract
Large-scale parallel corpus is extremely important for translation memory, example-based machine translation, and the support system to create English sentences. Organized collection or establishment of large-scale corpus is currently ongoing; however it is a difficult project in terms of copyrights as well as economic efficiency. To investigate general tendency of large-scale corpus helps to improve economical efficiency of parallel corpus collection as well as system establishment. In this study, therefore, the relationship between the scale of parallel corpus and the degree of correspondence is clarified, using parallel corpus for patents.
Anthology ID:
2005.mtsummit-wpt.5
Volume:
Workshop on patent translation
Month:
September 13-15
Year:
2005
Address:
Phuket, Thailand
Venue:
MTSummit
SIG:
Publisher:
Note:
Pages:
29–34
Language:
URL:
https://aclanthology.org/2005.mtsummit-wpt.5
DOI:
Bibkey:
Cite (ACL):
Isamu Okada, Shinichiro Miyazawa, Kazunari Ishida, Nobuhiko Shimizu, and Toshizumi Ohta. 2005. Quality Analysis of Patent Parallel Corpus by the Scale. In Workshop on patent translation, pages 29–34, Phuket, Thailand.
Cite (Informal):
Quality Analysis of Patent Parallel Corpus by the Scale (Okada et al., MTSummit 2005)
Copy Citation:
PDF:
https://preview.aclanthology.org/paclic-22-ingestion/2005.mtsummit-wpt.5.pdf