2024
pdf
bib
abs
Generating subject-matter expertise assessment questions with GPT-4: a medical translation use-case
Diana Silveira
|
Marina Sánchez-Torrón
|
Helena Moniz
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1)
This paper examines the suitability of a large language model (LLM), GPT-4, for generating multiple choice questions (MCQs) aimed at assessing subject matter expertise (SME) in the domain of medical translation. The main objective of these questions is to model the skills of potential subject matter experts in a human-in-the-loop machine translation (MT) flow, to ensure that tasks are matched to the individuals with the right skill profile. The investigation was conducted at Unbabel, an artificial intelligence-powered human translation platform. Two medical translation experts evaluated the GPT-4-generated questions and answers, one focusing on English–European Portuguese, and the other on English–German. We present a methodology for creating prompts to elicit high-quality GPT-4 outputs for this use case, as well as for designing evaluation scorecards for human review of such output. Our findings suggest that GPT-4 has the potential to generate suitable items for subject matter expertise tests, providing a more efficient approach compared to relying solely on humans. Furthermore, we propose recommendations for future research to build on our approach and refine the quality of the outputs generated by LLMs.
2022
pdf
bib
abs
Unsupervised Machine Translation in Real-World Scenarios
Ona de Gibert
|
Iakes Goenaga
|
Jordi Armengol-Estapé
|
Olatz Perez-de-Viñaspre
|
Carla Parra
|
Marina Sánchez-Torrón
|
Marcis Pinnis
|
Gorka Labaka
|
Maite Melero
Proceedings of the Thirteenth Language Resources and Evaluation Conference
In this work, we present the work that has been carried on in the MT4All CEF project and the resources that it has generated by leveraging recent research carried out in the field of unsupervised learning. In the course of the project 18 monolingual corpora for specific domains and languages have been collected, and 12 bilingual dictionaries and translation models have been generated. As part of the research, the unsupervised MT methodology based only on monolingual corpora (Artetxe et al., 2017) has been tested on a variety of languages and domains. Results show that in specialised domains, when there is enough monolingual in-domain data, unsupervised results are comparable to those of general domain supervised translation, and that, at any rate, unsupervised techniques can be used to boost results whenever very little data is available.
2016
pdf
bib
abs
Machine Translation Quality and Post-Editor Productivity
Marina Sanchez-Torron
|
Philipp Koehn
Conferences of the Association for Machine Translation in the Americas: MT Researchers' Track
We assessed how different machine translation (MT) systems affect the post-editing (PE) process and product of professional English–Spanish translators. Our model found that for each 1-point increase in BLEU, there is a PE time decrease of 0.16 seconds per word, about 3-4%. The MT system with the lowest BLEU score produced the output that was post-edited to the lowest quality and with the highest PE effort, measured both in HTER and actual PE operations.