Silvia Bernardini


2019

pdf
MAGMATic: A Multi-domain Academic Gold Standard with Manual Annotation of Terminology for Machine Translation Evaluation
Randy Scansani | Luisa Bentivogli | Silvia Bernardini | Adriano Ferraresi
Proceedings of Machine Translation Summit XVII: Research Track

pdf
Do translator trainees trust machine translation? An experiment on post-editing and revision
Randy Scansani | Silvia Bernardini | Adriano Ferraresi | Luisa Bentivogli
Proceedings of Machine Translation Summit XVII: Translator, Project and User Tracks

2017

pdf bib
Enhancing Machine Translation of Academic Course Catalogues with Terminological Resources
Randy Scansani | Silvia Bernardini | Adriano Ferraresi | Federico Gaspari | Marcello Soffritti
Proceedings of the Workshop Human-Informed Translation and Interpreting Technology

This paper describes an approach to translating course unit descriptions from Italian and German into English, using a phrase-based machine translation (MT) system. The genre is very prominent among those requiring translation by universities in European countries in which English is a non-native language. For each language combination, an in-domain bilingual corpus including course unit and degree program descriptions is used to train an MT engine, whose output is then compared to a baseline engine trained on the Europarl corpus. In a subsequent experiment, a bilingual terminology database is added to the training sets in both engines and its impact on the output quality is evaluated based on BLEU and post-editing score. Results suggest that the use of domain-specific corpora boosts the engines quality for both language combinations, especially for German-English, whereas adding terminological resources does not seem to bring notable benefits.

2009

pdf bib
Proceedings of the Workshop on Natural Language Processing Methods and Corpora in Translation, Lexicography, and Language Learning
Iustina Ilisei | Viktor Pekar | Silvia Bernardini
Proceedings of the Workshop on Natural Language Processing Methods and Corpora in Translation, Lexicography, and Language Learning

2004

pdf
Introducing the La Repubblica Corpus: A Large, Annotated, TEI(XML)-compliant Corpus of Newspaper Italian
Marco Baroni | Silvia Bernardini | Federica Comastri | Lorenzo Piccioni | Alessandra Volpi | Guy Aston | Marco Mazzoleni
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

pdf
BootCaT: Bootstrapping Corpora and Terms from the Web
Marco Baroni | Silvia Bernardini
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)