David Guzmán
2026
MERLIN: Multi-Stage Curriculum Alignment for Multilingual Encoder-LLM Integration in Cross-Lingual Reasoning
Kosei Uemura | David Guzmán | Quang Phuoc Nguyen | Jesujoba Oluwadara Alabi | En-Shiun Annie Lee | David Ifeoluwa Adelani
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)
Kosei Uemura | David Guzmán | Quang Phuoc Nguyen | Jesujoba Oluwadara Alabi | En-Shiun Annie Lee | David Ifeoluwa Adelani
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)
Large language models (LLMs) excel in English but still struggle with complex reasoning in many low-resource languages (LRLs). Existing methods align LLMs with multilingual encoders, such as LangBridge and MindMerger, raising the accuracy for mid and high-resource languages, yet large performance gap remains for LRLs. We present MERLIN, a model-stacking framework that iteratively refines in 2-stages based on a curriculum strategy (from general to specific where general is bilingual bitext and specific is task-specific data) and adapts only a small set of DoRA weights. On the AfriMGSM benchmark MERLIN improves exact-match accuracy by +12.9 pp over MindMerger and outperforms GPT-4o-mini by 15.2 pp. It also yields consistent gains on MGSM and MSVAMP (+0.9 and +2.8 pp), demonstrating effectiveness across both low and high-resource settings.
Findings of the AmericasNLP 2026 Shared Task on Cultural Image Captioning for Indigenous Languages
Minh Duc Bui | David Guzmán | Abteen Ebrahimi | Franklin Morales | Marvin Agüero-Torales | Raquel Insfrán | Cecilia González | Ramón Araujo | Luca Cernuzzi | Carlos Raul Noh Chi | Carlos Eduardo Tec Cahun | Sindi Estrella Poot Cohuo | Daniel Ricardo Benítez Chi | Santos Natanael Palomo Arévalo | Jessica Elizabeth Canul Canche | Deysi Aracely Poot Poot | Wendy Marleny Dzib Dzib | Eduardo José Ake Pool | Reynaldo Alexander Couoh Martin | Silvia Fernandez Sabido | Luis Samuel Santiago Melchor | Sotero Silverio | Robert Pugh | Raúl Vázquez | John E. Ortega | Arturo Oncevay | Rubén Manrique | Luis Chiruzzo | Rolando Coto-Solano | Elisabeth Mager | Shruti Rijhwani | David Ifeoluwa Adelani | Manuel Mager | Katharina von der Wense
Proceedings of the Sixth Workshop on NLP for Indigenous Languages of the Americas (AmericasNLP)
Minh Duc Bui | David Guzmán | Abteen Ebrahimi | Franklin Morales | Marvin Agüero-Torales | Raquel Insfrán | Cecilia González | Ramón Araujo | Luca Cernuzzi | Carlos Raul Noh Chi | Carlos Eduardo Tec Cahun | Sindi Estrella Poot Cohuo | Daniel Ricardo Benítez Chi | Santos Natanael Palomo Arévalo | Jessica Elizabeth Canul Canche | Deysi Aracely Poot Poot | Wendy Marleny Dzib Dzib | Eduardo José Ake Pool | Reynaldo Alexander Couoh Martin | Silvia Fernandez Sabido | Luis Samuel Santiago Melchor | Sotero Silverio | Robert Pugh | Raúl Vázquez | John E. Ortega | Arturo Oncevay | Rubén Manrique | Luis Chiruzzo | Rolando Coto-Solano | Elisabeth Mager | Shruti Rijhwani | David Ifeoluwa Adelani | Manuel Mager | Katharina von der Wense
Proceedings of the Sixth Workshop on NLP for Indigenous Languages of the Americas (AmericasNLP)
Indigenous languages of the Americas face severe endangerment, and the scarcity of culturally grounded resources remains a critical barrier to revitalization efforts. We present the AmericasNLP 2026 Shared Task on Cultural Image Captioning for Indigenous Languages, the first shared task dedicated to generating captions for images depicting Indigenous cultures of the Americas, written in the Indigenous languages themselves. To support this, we introduce and publicly release a newly constructed dataset spanning five cultures and their dominant languages: Bribri, Guaraní, Yucatec Maya, Central Veracruz Nahuatl, and Wixárika. Evaluation follows a two-stage process, combining automatic evaluation using ChrF++ with human evaluation of the top-performing systems for each language. Eight teams participate, submitting 27 systems in total. Results indicate that the task remains largely unsolved: while the strongest systems produce understandable captions, they fall short on descriptive detail and, critically, cultural grounding.
2025
AlignFreeze: Navigating the Impact of Realignment on the Layers of Multilingual Models Across Diverse Languages
Steve Bakos | David Guzmán | Riddhi More | Kelly Chutong Li | Félix Gaschi | En-Shiun Annie Lee
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers)
Steve Bakos | David Guzmán | Riddhi More | Kelly Chutong Li | Félix Gaschi | En-Shiun Annie Lee
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers)
Realignment techniques are often employed to enhance cross-lingual transfer in multilingual language models, still, they can sometimes degrade performance in languages that differ significantly from the fine-tuned source language. This paper introduces AlignFreeze, a method that freezes either the layers’ lower half or upper half during realignment. Through controlled experiments on 4 tasks, 3 models, and in 35 languages, we find that realignment affects all the layers but can be the most detrimental to the lower ones. Freezing the lower layers can prevent performance degradation. Particularly, AlignFreeze improves Part-of-Speech (PoS) tagging performances in languages where full realignment fails: with XLM-R, it provides improvements of more than one standard deviation in accuracy in seven more languages than full realignment.
2024
Unlocking Parameter-Efficient Fine-Tuning for Low-Resource Language Translation
Tong Su | Xin Peng | Sarubi Thillainathan | David Guzmán | Surangika Ranathunga | En-Shiun Lee
Findings of the Association for Computational Linguistics: NAACL 2024
Tong Su | Xin Peng | Sarubi Thillainathan | David Guzmán | Surangika Ranathunga | En-Shiun Lee
Findings of the Association for Computational Linguistics: NAACL 2024
Parameter-efficient fine-tuning (PEFT) methods are increasingly vital in adapting large-scale pre-trained language models for diverse tasks, offering a balance between adaptability and computational efficiency. They are important in Low-Resource Language (LRL) Neural Machine Translation (NMT) to enhance translation accuracy with minimal resources. However, their practical effectiveness varies significantly across different languages. We conducted comprehensive empirical experiments with varying LRL domains and sizes to evaluate the performance of 8 PEFT methods with in total of 15 architectures using the SacreBLEU score. We showed that 6 PEFT architectures outperform the baseline for both in-domain and out-domain tests and the Houlsby+Inversion adapter has the best performance overall, proving the effectiveness of PEFT methods.
Search
Fix author
Co-authors
- David Ifeoluwa Adelani 2
- En-Shiun Annie Lee 2
- Marvin Agüero-Torales 1
- Eduardo José Ake Pool 1
- Jesujoba Alabi 1
- Ramón Araujo 1
- Steve Bakos 1
- Daniel Ricardo Benítez Chi 1
- Minh Duc Bui 1
- Jessica Elizabeth Canul Canche 1
- Luca Cernuzzi 1
- Luis Chiruzzo 1
- Rolando Coto-Solano 1
- Reynaldo Alexander Couoh Martin 1
- Wendy Marleny Dzib Dzib 1
- Abteen Ebrahimi 1
- Silvia Fernández Sabido 1
- Felix Gaschi 1
- Cecilia González 1
- Raquel Insfrán 1
- En-Shiun Lee 1
- Kelly Chutong Li 1
- Manuel Mager 1
- Elisabeth Maier 1
- Rubén Manrique 1
- Franklin Morales 1
- Riddhi More 1
- Quang Phuoc Nguyen 1
- Carlos Raul Noh Chi 1
- Arturo Oncevay 1
- John E. Ortega 1
- Santos Natanael Palomo Arévalo 1
- Xin Peng 1
- Sindi Estrella Poot Cohuo 1
- Deysi Aracely Poot Poot 1
- Robert Pugh 1
- Surangika Ranathunga 1
- Shruti Rijhwani 1
- Luis Samuel Santiago Melchor 1
- Sotero Silverio 1
- Tong Su 1
- Carlos Eduardo Tec Cahun 1
- Sarubi Thillainathan 1
- Kosei Uemura 1
- Raúl Vázquez 1
- Katharina von der Wense 1