Example-based machine translation based on the synchronous SSTC annotation schema

Mosleh H. Al-Adhaileh, Tang Enya Kong


Abstract
In this paper, we describe an Example-Based Machine Translation (EBMT) system for English-Malay translation. Our approach is an example-based approach which relies sorely on example translations kept in a Bilingual Knowledge Bank (BKB). In our approach, a flexible annotation schema called Structured String-Tree Correspondence (SSTC) is used to annotate both the source and target sentences of a translation pair. Each SSTC describes a sentence, a representation tree as well as the correspondences between substrings in the sentence and subtrees in the representation tree. With both the source and target SSTCs established, a translation example in the BKB can then be represented effectively in terms of a pair of synchronous SSTCs. In the process of translation, we first try to build the representation tree for the source sentence (English) based on the example-based parsing algorithm as presented in [1]. By referring to the resultant source parse tree, we then proceed to synthesis the target sentence (Malay) based on the target SSTCs as pointed to by the synchronous SSTCs which encode the relationship between source and target SSTCs.
Anthology ID:
1999.mtsummit-1.36
Volume:
Proceedings of Machine Translation Summit VII
Month:
September 13-17
Year:
1999
Address:
Singapore, Singapore
Venue:
MTSummit
SIG:
Publisher:
Note:
Pages:
244–249
Language:
URL:
https://aclanthology.org/1999.mtsummit-1.36
DOI:
Bibkey:
Cite (ACL):
Mosleh H. Al-Adhaileh and Tang Enya Kong. 1999. Example-based machine translation based on the synchronous SSTC annotation schema. In Proceedings of Machine Translation Summit VII, pages 244–249, Singapore, Singapore.
Cite (Informal):
Example-based machine translation based on the synchronous SSTC annotation schema (Al-Adhaileh & Kong, MTSummit 1999)
Copy Citation:
PDF:
https://preview.aclanthology.org/auto-file-uploads/1999.mtsummit-1.36.pdf