Comparative evaluation of the linguistic output of MT systems for translation and information purposes

Elia Yuste-Rodrigo, Francine Braun-Chen


Abstract
This paper describes a Machine Translation (MT) evaluation experiment where emphasis is placed on the quality of output and the extent to which it is geared to different users' needs. Adopting a very specific scenario, that of a multilingual international organisation, a clear distinction is made between two user classes: translators and administrators. Whereas the first group requires MT output to be accurate and of good post-editable quality in order to produce a polished translation, the second group primarily needs informative data for carrying out other, non-linguistic tasks, and therefore uses MT more as an information-gathering and gisting tool. During the experiment, MT output of three different systems is compared in order to establish which MT system best serves the organisation's multilingual communication and information needs. This is a comparative usability- and adequacy-oriented evaluation in that it attempts to help such organisations decide which system produces the most adequate output for certain well-defined user types. To perform the experiment, criteria relating to both users and MT output are examined with reference to the ISLE taxonomy. The experiment comprises two evaluation phases, the first at sentence level, the second at overall text level. In both phases, evaluators make use of a 1-5 rating scale. Weighted results provide some insight into the systems' usability and adequacy for the purposes described above. As a conclusion, it is suggested that further research should be devoted to the most critical aspect of this exercise, namely defining meaningful and useful criteria for evaluating the post-editability and informativeness of MT output.
Anthology ID:
2001.mtsummit-eval.12
Volume:
Workshop on MT Evaluation
Month:
September 18-22
Year:
2001
Address:
Santiago de Compostela, Spain
Editors:
Eduard Hovy, Margaret King, Sandra Manzi, Florence Reeder
Venue:
MTSummit
SIG:
Publisher:
Note:
Pages:
Language:
URL:
https://aclanthology.org/2001.mtsummit-eval.12
DOI:
Bibkey:
Cite (ACL):
Elia Yuste-Rodrigo and Francine Braun-Chen. 2001. Comparative evaluation of the linguistic output of MT systems for translation and information purposes. In Workshop on MT Evaluation, Santiago de Compostela, Spain.
Cite (Informal):
Comparative evaluation of the linguistic output of MT systems for translation and information purposes (Yuste-Rodrigo & Braun-Chen, MTSummit 2001)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-4/2001.mtsummit-eval.12.pdf