EvalD Reference-Less Discourse Evaluation for WMT18
Ondřej Bojar, Jiří Mírovský, Kateřina Rysová, Magdaléna Rysová
Abstract
We present the results of automatic evaluation of discourse in machine translation (MT) outputs using the EVALD tool. EVALD was originally designed and trained to assess the quality of human writing, for native speakers and foreign-language learners. MT has seen a tremendous leap in translation quality at the level of sentences and it is thus interesting to see if the human-level evaluation is becoming relevant.- Anthology ID:
- W18-6432
- Volume:
- Proceedings of the Third Conference on Machine Translation: Shared Task Papers
- Month:
- October
- Year:
- 2018
- Address:
- Belgium, Brussels
- Editors:
- Ondřej Bojar, Rajen Chatterjee, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno Yepes, Philipp Koehn, Christof Monz, Matteo Negri, Aurélie Névéol, Mariana Neves, Matt Post, Lucia Specia, Marco Turchi, Karin Verspoor
- Venue:
- WMT
- SIG:
- SIGMT
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 541–545
- Language:
- URL:
- https://preview.aclanthology.org/icon-24-ingestion/W18-6432/
- DOI:
- 10.18653/v1/W18-6432
- Cite (ACL):
- Ondřej Bojar, Jiří Mírovský, Kateřina Rysová, and Magdaléna Rysová. 2018. EvalD Reference-Less Discourse Evaluation for WMT18. In Proceedings of the Third Conference on Machine Translation: Shared Task Papers, pages 541–545, Belgium, Brussels. Association for Computational Linguistics.
- Cite (Informal):
- EvalD Reference-Less Discourse Evaluation for WMT18 (Bojar et al., WMT 2018)
- PDF:
- https://preview.aclanthology.org/icon-24-ingestion/W18-6432.pdf