Abstract
This paper presents the submissions of IOL Research in WMT 2023 quality estimation shared task. We participate in task 1 Quality Estimation on both sentence and word levels, which predicts sentence quality score and word quality tags. Our system is a cross-lingual and multitask model for both sentence and word levels. We utilize several multilingual Pretrained Language Models (PLMs) as backbones and build task modules on them to achieve better predictions. A regression module on PLM is used to predict sentence level score and word tagging layer is used to classify the tag of each word in the translation based on the encoded representations from PLM. Each PLM is pretrained on quality estimation and metrics data from the previous WMT tasks before finetuning on training data this year. Furthermore, we integrate predictions from different models for better performance while the weights of each model are automatically searched and optimized by performance on Dev set. Our method achieves competitive results.- Anthology ID:
- 2023.wmt-1.76
- Volume:
- Proceedings of the Eighth Conference on Machine Translation
- Month:
- December
- Year:
- 2023
- Address:
- Singapore
- Editors:
- Philipp Koehn, Barry Haddow, Tom Kocmi, Christof Monz
- Venue:
- WMT
- SIG:
- SIGMT
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 863–871
- Language:
- URL:
- https://aclanthology.org/2023.wmt-1.76
- DOI:
- 10.18653/v1/2023.wmt-1.76
- Cite (ACL):
- Zeyu Yan. 2023. IOL Research’s Submission for WMT 2023 Quality Estimation Shared Task. In Proceedings of the Eighth Conference on Machine Translation, pages 863–871, Singapore. Association for Computational Linguistics.
- Cite (Informal):
- IOL Research’s Submission for WMT 2023 Quality Estimation Shared Task (Yan, WMT 2023)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-4/2023.wmt-1.76.pdf