Elisabeth Maier
Also published as: Elisabeth Mager
2026
Findings of the AmericasNLP 2026 Shared Task on Cultural Image Captioning for Indigenous Languages
Minh Duc Bui | David Guzmán | Abteen Ebrahimi | Franklin Morales | Marvin Agüero-Torales | Raquel Insfrán | Cecilia González | Ramón Araujo | Luca Cernuzzi | Carlos Raul Noh Chi | Carlos Eduardo Tec Cahun | Sindi Estrella Poot Cohuo | Daniel Ricardo Benítez Chi | Santos Natanael Palomo Arévalo | Jessica Elizabeth Canul Canche | Deysi Aracely Poot Poot | Wendy Marleny Dzib Dzib | Eduardo José Ake Pool | Reynaldo Alexander Couoh Martin | Silvia Fernandez Sabido | Luis Samuel Santiago Melchor | Sotero Silverio | Robert Pugh | Raúl Vázquez | John E. Ortega | Arturo Oncevay | Rubén Manrique | Luis Chiruzzo | Rolando Coto-Solano | Elisabeth Mager | Shruti Rijhwani | David Ifeoluwa Adelani | Manuel Mager | Katharina von der Wense
Proceedings of the Sixth Workshop on NLP for Indigenous Languages of the Americas (AmericasNLP)
Minh Duc Bui | David Guzmán | Abteen Ebrahimi | Franklin Morales | Marvin Agüero-Torales | Raquel Insfrán | Cecilia González | Ramón Araujo | Luca Cernuzzi | Carlos Raul Noh Chi | Carlos Eduardo Tec Cahun | Sindi Estrella Poot Cohuo | Daniel Ricardo Benítez Chi | Santos Natanael Palomo Arévalo | Jessica Elizabeth Canul Canche | Deysi Aracely Poot Poot | Wendy Marleny Dzib Dzib | Eduardo José Ake Pool | Reynaldo Alexander Couoh Martin | Silvia Fernandez Sabido | Luis Samuel Santiago Melchor | Sotero Silverio | Robert Pugh | Raúl Vázquez | John E. Ortega | Arturo Oncevay | Rubén Manrique | Luis Chiruzzo | Rolando Coto-Solano | Elisabeth Mager | Shruti Rijhwani | David Ifeoluwa Adelani | Manuel Mager | Katharina von der Wense
Proceedings of the Sixth Workshop on NLP for Indigenous Languages of the Americas (AmericasNLP)
Indigenous languages of the Americas face severe endangerment, and the scarcity of culturally grounded resources remains a critical barrier to revitalization efforts. We present the AmericasNLP 2026 Shared Task on Cultural Image Captioning for Indigenous Languages, the first shared task dedicated to generating captions for images depicting Indigenous cultures of the Americas, written in the Indigenous languages themselves. To support this, we introduce and publicly release a newly constructed dataset spanning five cultures and their dominant languages: Bribri, Guaraní, Yucatec Maya, Central Veracruz Nahuatl, and Wixárika. Evaluation follows a two-stage process, combining automatic evaluation using ChrF++ with human evaluation of the top-performing systems for each language. Eight teams participate, submitting 27 systems in total. Results indicate that the task remains largely unsolved: while the strongest systems produce understandable captions, they fall short on descriptive detail and, critically, cultural grounding.
2024
Proceedings of the 9th edition of the Swiss Text Analytics Conference
Corsin Capol | Mark Cieliebak | Albert Weichselbraun | Claudiu Musat | Elisabeth Maier | Lucas Zimmermann
Proceedings of the 9th edition of the Swiss Text Analytics Conference
Corsin Capol | Mark Cieliebak | Albert Weichselbraun | Claudiu Musat | Elisabeth Maier | Lucas Zimmermann
Proceedings of the 9th edition of the Swiss Text Analytics Conference
2023
Ethical Considerations for Machine Translation of Indigenous Languages: Giving a Voice to the Speakers
Manuel Mager | Elisabeth Mager | Katharina Kann | Ngoc Thang Vu
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Manuel Mager | Elisabeth Mager | Katharina Kann | Ngoc Thang Vu
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
In recent years machine translation has become very successful for high-resource language pairs. This has also sparked new interest in research on the automatic translation of low-resource languages, including Indigenous languages. However, the latter are deeply related to the ethnic and cultural groups that speak (or used to speak) them. The data collection, modeling and deploying machine translation systems thus result in new ethical questions that must be addressed. Motivated by this, we first survey the existing literature on ethical considerations for the documentation, translation, and general natural language processing for Indigenous languages. Afterward, we conduct and analyze an interview study to shed light on the positions of community leaders, teachers, and language activists regarding ethical concerns for the automatic translation of their languages. Our results show that the inclusion, at different degrees, of native speakers and community members is vital to performing better and more ethical research on Indigenous languages.
2022
BPE vs. Morphological Segmentation: A Case Study on Machine Translation of Four Polysynthetic Languages
Manuel Mager | Arturo Oncevay | Elisabeth Mager | Katharina Kann | Thang Vu
Findings of the Association for Computational Linguistics: ACL 2022
Manuel Mager | Arturo Oncevay | Elisabeth Mager | Katharina Kann | Thang Vu
Findings of the Association for Computational Linguistics: ACL 2022
Morphologically-rich polysynthetic languages present a challenge for NLP systems due to data sparsity, and a common strategy to handle this issue is to apply subword segmentation. We investigate a wide variety of supervised and unsupervised morphological segmentation methods for four polysynthetic languages: Nahuatl, Raramuri, Shipibo-Konibo, and Wixarika. Then, we compare the morphologically inspired segmentation methods against Byte-Pair Encodings (BPEs) as inputs for machine translation (MT) when translating to and from Spanish. We show that for all language pairs except for Nahuatl, an unsupervised morphological segmentation algorithm outperforms BPEs consistently and that, although supervised methods achieve better segmentation scores, they under-perform in MT challenges. Finally, we contribute two new morphological segmentation datasets for Raramuri and Shipibo-Konibo, and a parallel corpus for Raramuri–Spanish.
AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages
Abteen Ebrahimi | Manuel Mager | Arturo Oncevay | Vishrav Chaudhary | Luis Chiruzzo | Angela Fan | John Ortega | Ricardo Ramos | Annette Rios | Ivan Vladimir Meza Ruiz | Gustavo Giménez-Lugo | Elisabeth Mager | Graham Neubig | Alexis Palmer | Rolando Coto-Solano | Thang Vu | Katharina Kann
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Abteen Ebrahimi | Manuel Mager | Arturo Oncevay | Vishrav Chaudhary | Luis Chiruzzo | Angela Fan | John Ortega | Ricardo Ramos | Annette Rios | Ivan Vladimir Meza Ruiz | Gustavo Giménez-Lugo | Elisabeth Mager | Graham Neubig | Alexis Palmer | Rolando Coto-Solano | Thang Vu | Katharina Kann
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Pretrained multilingual models are able to perform cross-lingual transfer in a zero-shot setting, even for languages unseen during pretraining. However, prior work evaluating performance on unseen languages has largely been limited to low-level, syntactic tasks, and it remains unclear if zero-shot learning of high-level, semantic tasks is possible for unseen languages. To explore this question, we present AmericasNLI, an extension of XNLI (Conneau et al., 2018) to 10 Indigenous languages of the Americas. We conduct experiments with XLM-R, testing multiple zero-shot and translation-based approaches. Additionally, we explore model adaptation via continued pretraining and provide an analysis of the dataset by considering hypothesis-only models. We find that XLM-R’s zero-shot performance is poor for all 10 languages, with an average performance of 38.48%. Continued pretraining offers improvements, with an average accuracy of 43.85%. Surprisingly, training on poorly translated data by far outperforms all other methods with an accuracy of 49.12%.
2018
Lost in Translation: Analysis of Information Loss During Machine Translation Between Polysynthetic and Fusional Languages
Manuel Mager | Elisabeth Mager | Alfonso Medina-Urrea | Ivan Vladimir Meza Ruiz | Katharina Kann
Proceedings of the Workshop on Computational Modeling of Polysynthetic Languages
Manuel Mager | Elisabeth Mager | Alfonso Medina-Urrea | Ivan Vladimir Meza Ruiz | Katharina Kann
Proceedings of the Workshop on Computational Modeling of Polysynthetic Languages
Machine translation from polysynthetic to fusional languages is a challenging task, which gets further complicated by the limited amount of parallel text available. Thus, translation performance is far from the state of the art for high-resource and more intensively studied language pairs. To shed light on the phenomena which hamper automatic translation to and from polysynthetic languages, we study translations from three low-resource, polysynthetic languages (Nahuatl, Wixarika and Yorem Nokki) into Spanish and vice versa. Doing so, we find that in a morpheme-to-morpheme alignment an important amount of information contained in polysynthetic morphemes has no Spanish counterpart, and its translation is often omitted. We further conduct a qualitative analysis and, thus, identify morpheme types that are commonly hard to align or ignored in the translation process.
2004
How to Disassemble Alphabetical Processions - Morphological Treatment of Unknown Words
Stephan Bopp | Sandro Pedrazzini | Elisabeth Maier
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)
Stephan Bopp | Sandro Pedrazzini | Elisabeth Maier
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)
2003
Scalability in MT systems
Elisabeth Maier | Anthony Clarke
Proceedings of Machine Translation Summit IX: Papers
Elisabeth Maier | Anthony Clarke
Proceedings of Machine Translation Summit IX: Papers
In this paper we show why scalability is one of the most important aspects for the evaluation of Machine Translation (MT) systems and what scalability entails in the framework of MT. We illustrate the issue of scalability by reporting about an MT solution, which has been chosen in the course of a thorough hands-on evaluation and which in the meantime has been developed from a pilot system to a MT turnkey solution for mid-to large-scale enterprises.
2002
A report on the experiences of implementing an MT system for use in a commercial environment
Anthony Clarke | Elisabeth Maier | Hans-Udo Stadler
Proceedings of the 5th Conference of the Association for Machine Translation in the Americas: User Studies
Anthony Clarke | Elisabeth Maier | Hans-Udo Stadler
Proceedings of the 5th Conference of the Association for Machine Translation in the Americas: User Studies
This paper describes the process of implementing a machine translation system (MT system) and the problems and pitfalls encountered within this process at CLS Corporate Language Services AG, a language solutions provider for the Swiss financial services industry, in particular UBS AG and Zurich Financial Services. The implementation was based on the perceived requirements of large organizations, which is why the focus was more on practical rather than academic aspects. The paper can be roughly divided into three parts: (1) definition of the implementation process, co-ordination and execution, (2) implementation plan and customer/user management, (3) monitoring of the MT system and related maintenance after going live.
2001
Evaluation of machine translation systems at CLS Corporate Language Services AG
Elisabeth Maier | Anthony Clarke | Hans-Udo Stadler
Proceedings of Machine Translation Summit VIII
Elisabeth Maier | Anthony Clarke | Hans-Udo Stadler
Proceedings of Machine Translation Summit VIII
This paper describes the evaluation of Machine Translation (MT) System for use in a large company. To take into account the specific requirements of such an environment, a pragmatic approach for the evaluation was developed. It consists of five steps ranging from a specification of the evaluation process to the integration of the chosen MT system in a given infrastructure. The process includes a specification of MT evaluation criteria relevant to systems which have to be employed for a large customer base. The paper also shows the results of such an evaluation study which was recently carried out at CLS Corporate Language Services AG, where COMPRENDIUM is in the meantime being employed as corporate MT system.
2000
Terms Specification and Extraction within a Linguistic-based Intranet Service
Sandro Pedrazzini | Elisabeth Maier | Dierk König
Proceedings of the Second International Conference on Language Resources and Evaluation (LREC’00)
Sandro Pedrazzini | Elisabeth Maier | Dierk König
Proceedings of the Second International Conference on Language Resources and Evaluation (LREC’00)
1997
Clarification Dialogues as Measure to Increase Robustness in a Spoken Dialogue System
Elisabeth Maier | Norbert Reithinger | Jan Alexandersson
Interactive Spoken Dialog Systems: Bringing Speech and NLP Together in Real Applications
Elisabeth Maier | Norbert Reithinger | Jan Alexandersson
Interactive Spoken Dialog Systems: Bringing Speech and NLP Together in Real Applications
Insights into the Dialogue Processing of VERBMOBIL
Jan Alexandersson | Norbert Reithinger | Elisabeth Maier
Fifth Conference on Applied Natural Language Processing
Jan Alexandersson | Norbert Reithinger | Elisabeth Maier
Fifth Conference on Applied Natural Language Processing
1995
Utilizing Statistical Dialogue Act Processing in Verbmobil
Norbert Reithinger | Elisabeth Maier
33rd Annual Meeting of the Association for Computational Linguistics
Norbert Reithinger | Elisabeth Maier
33rd Annual Meeting of the Association for Computational Linguistics
A Robust and Efficient Three-Layered Dialogue Component for a Speech-to-Speech Translation System
Jan Alexandersson | Elisabeth Maier | Norbert Reithinger
Seventh Conference of the European Chapter of the Association for Computational Linguistics
Jan Alexandersson | Elisabeth Maier | Norbert Reithinger
Seventh Conference of the European Chapter of the Association for Computational Linguistics
1994
Generating Cooperative System Responses in Information Retrieval Dialogues
Markus Fischer | Elisabeth Maier | Adelheit Stein
Proceedings of the Seventh International Workshop on Natural Language Generation
Markus Fischer | Elisabeth Maier | Adelheit Stein
Proceedings of the Seventh International Workshop on Natural Language Generation
1993
Search
Fix author
Co-authors
- Manuel Mager 5
- Katharina von der Wense 5
- Norbert Reithinger 4
- Jan Alexandersson 3
- Anthony Clarke 3
- Arturo Oncevay 3
- Luis Chiruzzo 2
- Rolando Coto-Solano 2
- Abteen Ebrahimi 2
- Ivan Meza-Ruiz 2
- Sandro Pedrazzini 2
- Hans-Udo Stadler 2
- Thang Vu 2
- David Ifeoluwa Adelani 1
- Marvin Agüero-Torales 1
- Eduardo José Ake Pool 1
- Ramón Araujo 1
- Daniel Ricardo Benítez Chi 1
- Stephan Bopp 1
- Minh Duc Bui 1
- Jessica Elizabeth Canul Canche 1
- Corsin Capol 1
- Luca Cernuzzi 1
- Vishrav Chaudhary 1
- Mark Cieliebak 1
- Reynaldo Alexander Couoh Martin 1
- Wendy Marleny Dzib Dzib 1
- Angela Fan 1
- Silvia Fernández Sabido 1
- Markus Fischer 1
- Gustavo Giménez-Lugo 1
- Cecilia González 1
- David Guzmán 1
- Raquel Insfrán 1
- Dierk König 1
- Rubén Manrique 1
- Alfonso Medina-Urrea 1
- Franklin Morales 1
- Claudiu Musat 1
- Graham Neubig 1
- Carlos Raul Noh Chi 1
- John Ortega 1
- John E. Ortega 1
- Alexis Palmer 1
- Santos Natanael Palomo Arévalo 1
- Sindi Estrella Poot Cohuo 1
- Deysi Aracely Poot Poot 1
- Robert Pugh 1
- Ricardo Ramos 1
- Shruti Rijhwani 1
- Annette Rios Gonzales 1
- Luis Samuel Santiago Melchor 1
- Sotero Silverio 1
- Adelheit Stein 1
- Carlos Eduardo Tec Cahun 1
- Ngoc Thang Vu 1
- Raúl Vázquez 1
- Albert Weichselbraun 1
- Lucas Zimmermann 1