Iterative Alignment of Syntactic Structures for a Bilingual Corpus

Ralph Grishman


Abstract
Alignment of parallel bilingual corpora at the level of syntactic structure holds the promise of being able to discover detailed bilingual structural correspondences automatically. This paper describes a procedure for the alignment of regularized syntactic structures, proceeding bottom-up through the trees. It makes use of information about possible lexical correspondences, from a bilingual dictionary, to generate initial candidate alignments. We consider in particular how much dictionary coverage is needed for the alignment process, and how the alignment can be iteratively improved by having an initial alignment generate additional lexical correspondences for the dictionary, and then using this augmented dictionary for subsequent alignment passes.
Anthology ID:
1994.vlc-1.5
Volume:
Second Workshop on Very Large Corpora
Month:
Year:
1994
Address:
Venue:
VLC
SIG:
Publisher:
Note:
Pages:
57–68
Language:
URL:
https://aclanthology.org/1994.vlc-1.5
DOI:
Bibkey:
Cite (ACL):
Ralph Grishman. 1994. Iterative Alignment of Syntactic Structures for a Bilingual Corpus. In Second Workshop on Very Large Corpora, pages 57–68.
Cite (Informal):
Iterative Alignment of Syntactic Structures for a Bilingual Corpus (Grishman, VLC 1994)
Copy Citation:
PDF:
https://preview.aclanthology.org/corrections-2024-07/1994.vlc-1.5.pdf