Why Low-Resource NLP Needs More Than Cross-Lingual Transfer: Lessons Learned from Luxembourgish
Fred Philippy, Siwen Guo, Jacques Klein, Tegawendé F. Bissyandé
Abstract
Cross-lingual transfer has become a central paradigm for extending natural language processing (NLP) technologies to low-resource languages. By leveraging supervision from high-resource languages, multilingual language models can achieve strong task performance with little or no labeled target-language data. However, it remains unclear to what extent cross-lingual transfer can substitute for language-specific efforts. In this paper, we synthesize prior research findings and data collection results on Luxembourgish, which, despite its typological proximity to high-resource languages and its presence in a multilingual context, remains insufficiently represented in modern NLP technologies. Across findings, we observe a fundamental interdependence between cross-lingual transfer and language-specific efforts. Cross-lingual transfer can substantially improve target-language performance, but its success depends critically on the availability of sufficiently high-quality, task-aligned target-language data. At the same time, such resources, particularly in low-resource settings, are typically too limited in scale to drive strong performance on their own. Instead, such resources reach their full potential only when leveraged within a cross-lingual framework. We therefore argue that cross-lingual transfer and language-specific efforts should not be viewed as competing alternatives. Instead, they function as complementary components of a sustainable low-resource NLP pipeline. Based on these insights, we provide practical guidelines for integrating and balancing cross-lingual transfer with language-specific development in sustainable low-resource NLP pipelines.- Anthology ID:
- 2026.bigpicture-main.7
- Volume:
- Proceedings of The Big Picture v2: Crafting a Research Narrative
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, CA, USA
- Editors:
- Yanai Elazar, Allyson Ettinger, Nora Kassner, Sebastian Ruder
- Venues:
- BigPicture | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 82–93
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl-workshops/2026.bigpicture-main.7/
- DOI:
- Cite (ACL):
- Fred Philippy, Siwen Guo, Jacques Klein, and Tegawendé F. Bissyandé. 2026. Why Low-Resource NLP Needs More Than Cross-Lingual Transfer: Lessons Learned from Luxembourgish. In Proceedings of The Big Picture v2: Crafting a Research Narrative, pages 82–93, San Diego, CA, USA. Association for Computational Linguistics.
- Cite (Informal):
- Why Low-Resource NLP Needs More Than Cross-Lingual Transfer: Lessons Learned from Luxembourgish (Philippy et al., BigPicture 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl-workshops/2026.bigpicture-main.7.pdf