The Annotation Scheme of English-Chinese Clause Alignment Corpus

Shili Ge, Xiaopin Lin, Rou Song


Abstract
A clause complex consists of clauses, which are connected by component sharing relations and logic-semantic relations. Hence, clause-complex level structural transformations in translation are concerned with the expression adjustment of these two types of relations. In this paper, a formal scheme for tagging structural transformations in English-Chinese translation is designed. The annotation scheme include 3 steps operated on two grammatical levels: parsing an English clause complex into constructs and assembling construct translations on the clause complex level; translating constructs independently on the clause level. The assembling step involves 2 operations: performing operation functions and inserting Chinese words. The corpus annotation shows that it is feasible to divide structural transformations in English-Chinese translation into 2 levels. The corpus, which unfolds formally the operations of clause-complex level structural transformations, would help to improve the end-to-end translation of complicated sentences.
Anthology ID:
2020.ccl-1.96
Volume:
Proceedings of the 19th Chinese National Conference on Computational Linguistics
Month:
October
Year:
2020
Address:
Haikou, China
Venue:
CCL
SIG:
Publisher:
Chinese Information Processing Society of China
Note:
Pages:
1035–1044
Language:
English
URL:
https://aclanthology.org/2020.ccl-1.96
DOI:
Bibkey:
Cite (ACL):
Shili Ge, Xiaopin Lin, and Rou Song. 2020. The Annotation Scheme of English-Chinese Clause Alignment Corpus. In Proceedings of the 19th Chinese National Conference on Computational Linguistics, pages 1035–1044, Haikou, China. Chinese Information Processing Society of China.
Cite (Informal):
The Annotation Scheme of English-Chinese Clause Alignment Corpus (Ge et al., CCL 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/auto-file-uploads/2020.ccl-1.96.pdf