Dictionary-Aided Translation for Handling Multi-Word Expressions in Low-Resource Languages
Antonios Dimakis, Stella Markantonatou, Antonios Anastasopoulos
Abstract
Multi-word expressions (MWEs) present unique challenges in natural language processing (NLP), particularly within the context of translation systems, due to their inherent scarcity, non-compositional nature, and other distinct lexical and morphosyntactic characteristics, issues that are exacerbated in low-resource settings.In this study, we elucidate and attempt to address these challenges by leveraging a substantial corpus of human-annotated Greek MWEs. To address the complexity of translating such phrases, we propose a novel method leveraging an available out-of-context lexicon.We assess the translation capabilities of current state-of-the-art systems on this task, employing both automated metrics and human evaluators.We find that by using our method when applicable, the performance of current systems can be significantly improved, however these models are still unable to produce translations comparable to those of a human speaker.- Anthology ID:
- 2024.findings-acl.152
- Volume:
- Findings of the Association for Computational Linguistics ACL 2024
- Month:
- August
- Year:
- 2024
- Address:
- Bangkok, Thailand and virtual meeting
- Editors:
- Lun-Wei Ku, Andre Martins, Vivek Srikumar
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 2588–2595
- Language:
- URL:
- https://aclanthology.org/2024.findings-acl.152
- DOI:
- Cite (ACL):
- Antonios Dimakis, Stella Markantonatou, and Antonios Anastasopoulos. 2024. Dictionary-Aided Translation for Handling Multi-Word Expressions in Low-Resource Languages. In Findings of the Association for Computational Linguistics ACL 2024, pages 2588–2595, Bangkok, Thailand and virtual meeting. Association for Computational Linguistics.
- Cite (Informal):
- Dictionary-Aided Translation for Handling Multi-Word Expressions in Low-Resource Languages (Dimakis et al., Findings 2024)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-4/2024.findings-acl.152.pdf