Dan2eng: wide-coverage Danish-English machine translation

Eckhard Bick


Abstract
The paper presents and evaluates a wide coverage, rule-governed machine translation system for Danish-English. Analysis and polysemy resolution are based on Constraint Grammar dependency trees. In its 85.000 lexeme lexicon, Dan2eng uses context-sensitive lexical transfer rules linking dependencies to semantic prototype conditions, syntactic function, definiteness etc. Dependency is further exploited instead of constituent bracketing to support syntactic movement rules. A robust derivational and compound analysis, as well as a separate NER module permit the handling of unrestricted text from a wide range of genres. The system averaged TER scores of 7 (BLEU 0.55-0.6) on student tasks, but performance varied widely against raw and edited Europarl references, respectively.
Anthology ID:
2007.mtsummit-papers.6
Volume:
Proceedings of Machine Translation Summit XI: Papers
Month:
September 10-14
Year:
2007
Address:
Copenhagen, Denmark
Editor:
Bente Maegaard
Venue:
MTSummit
SIG:
Publisher:
Note:
Pages:
Language:
URL:
https://preview.aclanthology.org/software-overview/2007.mtsummit-papers.6/
DOI:
Bibkey:
Cite (ACL):
Eckhard Bick. 2007. Dan2eng: wide-coverage Danish-English machine translation. In Proceedings of Machine Translation Summit XI: Papers, Copenhagen, Denmark.
Cite (Informal):
Dan2eng: wide-coverage Danish-English machine translation (Bick, MTSummit 2007)
Copy Citation:
PDF:
https://preview.aclanthology.org/software-overview/2007.mtsummit-papers.6.pdf