2024
pdf
Psychoanalytic Studies in the Digital Humanities: Employing Topic Modeling with an LLM to Decode Dreams During the Brazilian Pandemic
João Pedro Campos
|
Natalia Resende
|
Ricardo de Souza
|
Gilson Iannini
Proceedings of the 16th International Conference on Computational Processing of Portuguese - Vol. 2
2022
pdf
abs
MT-Pese: Machine Translation and Post-Editese
Sheila Castilho
|
Natália Resende
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation
This paper introduces the MT-Pese project, which aims at researching the post-editese phenomena in machine translated texts. We describe a range of experiments performed in order to gauge the effect of post-editese in dif-ferent domains, backtranslation, and quality.
pdf
abs
MTrill: Machine Translation Impact on Language Learning
Natalia Resende
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation
This paper presents the MTrill project which aimed at investigating the impact of popular web-based machine transla-tion (MT) tools on the cognitive pro-cessing of English as a second language. The methodological approach and main results are presented.
pdf
abs
Achievements of the PRINCIPLE Project: Promoting MT for Croatian, Icelandic, Irish and Norwegian
Petra Bago
|
Sheila Castilho
|
Jane Dunne
|
Federico Gaspari
|
Andre K
|
Gauti Kristmannsson
|
Jon Arild Olsen
|
Natalia Resende
|
Níels Rúnar Gíslason
|
Dana D. Sheridan
|
Páraic Sheridan
|
John Tinsley
|
Andy Way
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation
This paper provides an overview of the main achievements of the completed PRINCIPLE project, a 2-year action funded by the European Commission under the Connecting Europe Facility (CEF) programme. PRINCIPLE focused on collecting high-quality language resources for Croatian, Icelandic, Irish and Norwegian, which are severely low-resource languages, especially for building effective machine translation (MT) systems. We report the achievements of the project, primarily, in terms of the large amounts of data collected for all four low-resource languages and of promoting the uptake of neural MT (NMT) for these languages.
pdf
abs
Overview of the ELE Project
Itziar Aldabe
|
Jane Dunne
|
Aritz Farwell
|
Owen Gallagher
|
Federico Gaspari
|
Maria Giagkou
|
Jan Hajic
|
Jens Peter Kückens
|
Teresa Lynn
|
Georg Rehm
|
German Rigau
|
Katrin Marheinecke
|
Stelios Piperidis
|
Natalia Resende
|
Tea Vojtěchová
|
Andy Way
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation
This paper provides an overview of the ongoing European Language Equality(ELE) project, an 18-month action funded by the European Commission which involves 52 partners. The primary goal of ELE is to prepare the European Language Equality Programme, in the form of a strategic research, innovation and implementation agenda and a roadmap for achieving full digital language equality (DLE) in Europe by 2030.
2021
abs
Building MT systems in low resourced languages for Public Sector users in Croatia, Iceland, Ireland, and Norway
Róisín Moran
|
Carla Para Escartín
|
Akshai Ramesh
|
Páraic Sheridan
|
Jane Dunne
|
Federico Gaspari
|
Sheila Castilho
|
Natalia Resende
|
Andy Way
Proceedings of Machine Translation Summit XVIII: Users and Providers Track
When developing Machine Translation engines, low resourced language pairs tend to be in a disadvantaged position: less available data means that developing robust MT models can be more challenging. The EU-funded PRINCIPLE project aims at overcoming this challenge for four low resourced European languages: Norwegian, Croatian, Irish and Icelandic. This presentation will give an overview of the project, with a focus on the set of Public Sector users and their use cases for which we have developed MT solutions. We will discuss the range of language resources that have been gathered through contributions from public sector collaborators, and present the extensive evaluations that have been undertaken, including significant user evaluation of MT systems across all of the public sector participants in each of the four countries involved.
2020
pdf
abs
MT syntactic priming effects on L2 English speakers
Natália Resende
|
Benjamin Cowan
|
Andy Way
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation
In this paper, we tested 20 Brazilian Portuguese speakers at intermediate and advanced English proficiency levels to investigate the influence of Google Translate’s MT system on the mental processing of English as a second language. To this end, we employed a syntactic priming experimental paradigm using a pretest-priming design which allowed us to compare participants’ linguistic behaviour before and after a translation task using Google Translate. Results show that, after performing a translation task with Google Translate, participants more frequently described images in English using the syntactic alternative previously seen in the output of Google Translate, compared to the translation task with no prior influence of the MT output. Results also show that this syntactic priming effect is modulated by English proficiency levels.
pdf
abs
MTrill project: Machine Translation impact on language learning
Natália Resende
|
Andy Way
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation
Over the last decades, massive research investments have been made in the development of machine translation (MT) systems (Gupta and Dhawan, 2019). This has brought about a paradigm shift in the performance of these language tools, leading to widespread use of popular MT systems (Gaspari and Hutchins, 2007). Although the first MT engines were used for gisting purposes, in recent years, there has been an increasing interest in using MT tools, especially the freely available online MT tools, for language teaching and learning (Clifford et al., 2013). The literature on MT and Computer Assisted Language Learning (CALL) shows that, over the years, MT systems have been facilitating language teaching and also language learning (Nin ̃o, 2006). It has been shown that MT tools can increase awareness of grammatical linguistic features of a foreign language. Research also shows the positive role of MT systems in the development of writing skills in English as well as in improving communication skills in English(Garcia and Pena, 2011). However, to date, the cognitive impact of MT on language acquisition and on the syntactic aspects of language processing has not yet been investigated and deserves further scrutiny. The MTril project aims at filling this gap in the literature by examining whether MT is contributing to a central aspect of language acquisition: the so-called language binding, i.e., the ability to combine single words properly in a grammatical sentence (Heyselaar et al., 2017; Ferreira and Bock, 2006). The project focus on the initial stages (pre-intermediate and intermediate) of the acquisition of English syntax by Brazilian Portuguese native speakers using MT systems as a support for language learning.
2019
pdf
Large-scale Machine Translation Evaluation of the iADAATPA Project
Sheila Castilho
|
Natália Resende
|
Federico Gaspari
|
Andy Way
|
Tony O’Dowd
|
Marek Mazur
|
Manuel Herranz
|
Alex Helle
|
Gema Ramírez-Sánchez
|
Víctor Sánchez-Cartagena
|
Mārcis Pinnis
|
Valters Šics
Proceedings of Machine Translation Summit XVII: Translator, Project and User Tracks
pdf
abs
What Influences the Features of Post-editese? A Preliminary Study
Sheila Castilho
|
Natália Resende
|
Ruslan Mitkov
Proceedings of the Human-Informed Translation and Interpreting Technology Workshop (HiT-IT 2019)
While a number of studies have shown evidence of translationese phenomena, that is, statistical differences between original texts and translated texts (Gellerstam, 1986), results of studies searching for translationese features in postedited texts (what has been called ”posteditese” (Daems et al., 2017)) have presented mixed results. This paper reports a preliminary study aimed at identifying the presence of post-editese features in machine-translated post-edited texts and at understanding how they differ from translationese features. We test the influence of factors such as post-editing (PE) levels (full vs. light), translation proficiency (professionals vs. students) and text domain (news vs. literary). Results show evidence of post-editese features, especially in light PE texts and in certain domains.