MT evaluation

Margaret King, Eduard Hovy, Benjamin K. Tsou, John White, Yusoff Zaharin


Abstract
This panel deals with the general topic of evaluation of machine translation systems. The first contribution sets out some recent work on creating standards for the design of evaluations. The second, by Eduard Hovy. takes up the particular issue of how metrics can be differentiated and systematized. Benjamin K. T'sou suggests that whilst men may evaluate machines, machines may also evaluate men. John S. White focuses on the question of the role of the user in evaluation design, and Yusoff Zaharin points out that circumstances and settings may have a major influence on evaluation design.
Anthology ID:
1999.mtsummit-1.31
Volume:
Proceedings of Machine Translation Summit VII
Month:
September 13-17
Year:
1999
Address:
Singapore, Singapore
Venue:
MTSummit
SIG:
Publisher:
Note:
Pages:
197–207
Language:
URL:
https://aclanthology.org/1999.mtsummit-1.31
DOI:
Bibkey:
Cite (ACL):
Margaret King, Eduard Hovy, Benjamin K. Tsou, John White, and Yusoff Zaharin. 1999. MT evaluation. In Proceedings of Machine Translation Summit VII, pages 197–207, Singapore, Singapore.
Cite (Informal):
MT evaluation (King et al., MTSummit 1999)
Copy Citation:
PDF:
https://preview.aclanthology.org/add_acl24_videos/1999.mtsummit-1.31.pdf