Grammatical Error Correction for Low-Resource Languages: The Case of Zarma
Mamadou K. Keita, Adwoa Bremang, Huy Le, Dennis Owusu, Marcos Zampieri, Christopher Homan
Abstract
Grammatical error correction (GEC) aims to improve text quality and readability. Previous work on the task focused primarily on high-resource languages, while low-resource languages lack robust tools. To address this shortcoming, we present a study on GEC for Zarma, a language spoken by over five million people in West Africa. We compare three approaches: rule-based methods, machine translation (MT) models, and large language models (LLMs). We evaluated GEC models using a dataset of more than 250,000 examples, including synthetic and human-annotated data. Our results showed that the MT-based approach using M2M100 outperforms others, with a detection rate of 95.82% and a suggestion accuracy of 78.90% in automatic evaluations (AE) and an average score of 3.0 out of 5.0 in manual evaluation (ME) from native speakers for grammar and logical corrections. The rule-based method was effective for spelling errors but failed on complex context-level errors. LLMs—Gemma 2b and MT5-small—showed moderate performance. Our work supports use of MT models to enhance GEC in low-resource settings, and we validated these results with Bambara, another West African language.- Anthology ID:
- 2026.loreslm-1.9
- Volume:
- Proceedings of the Second Workshop on Language Models for Low-Resource Languages (LoResLM 2026)
- Month:
- March
- Year:
- 2026
- Address:
- Rabat, Morocco
- Editors:
- Hansi Hettiarachchi, Tharindu Ranasinghe, Alistair Plum, Paul Rayson, Ruslan Mitkov, Mohamed Gaber, Damith Premasiri, Fiona Anting Tan, Lasitha Uyangodage
- Venue:
- LoResLM
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 98–109
- Language:
- URL:
- https://preview.aclanthology.org/missing-isa-paper/2026.loreslm-1.9/
- DOI:
- 10.18653/v1/2026.loreslm-1.9
- Cite (ACL):
- Mamadou K. Keita, Adwoa Bremang, Huy Le, Dennis Owusu, Marcos Zampieri, and Christopher Homan. 2026. Grammatical Error Correction for Low-Resource Languages: The Case of Zarma. In Proceedings of the Second Workshop on Language Models for Low-Resource Languages (LoResLM 2026), pages 98–109, Rabat, Morocco. Association for Computational Linguistics.
- Cite (Informal):
- Grammatical Error Correction for Low-Resource Languages: The Case of Zarma (Keita et al., LoResLM 2026)
- PDF:
- https://preview.aclanthology.org/missing-isa-paper/2026.loreslm-1.9.pdf