Story Generation with Large Language Models for African Languages

Catherine Nana Nyaah Essuman; Jan Buys

Story Generation with Large Language Models for African Languages

Abstract

The development of Large Language Models (LLMs) for African languages has been hindered by the lack of large-scale textual data. Previous research has shown that relatively small language models, when trained on synthetic data generated by larger models, can produce fluent, short English stories, providing a data-efficient alternative to large-scale pretraining. In this paper, we apply a similar approach to develop and evaluate small language models for generating childrens stories in isiZulu and Yoruba, using synthetic datasets created through translation and multilingual prompting. We train six language-specific models varying in dataset size and source, and based on the GPT-2 architecture. Our results show that models trained on synthetic low-resource data are capable of producing coherent and fluent short stories in isiZulu and Yoruba. Models trained on larger synthetic datasets generally perform better in terms of coherence and grammar, and also tend to generalize better, as seen by their lower evaluation perplexities. Models trained on datasets generated through prompting instead of translation generate similar or more coherent stories and display more creativity, but perform worse in terms of generalization to unseen data. In addition to the potential educational applications of the automated story generation, our approach has the potential to be used as the foundation for more data-efficient low-resource language models.

Anthology ID:: 2025.africanlp-1.16
Volume:: Proceedings of the Sixth Workshop on African Natural Language Processing (AfricaNLP 2025)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Constantine Lignos, Idris Abdulmumin, David Adelani
Venues:: AfricaNLP | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 115–125
Language:
URL:: https://preview.aclanthology.org/acl25-workshop-ingestion/2025.africanlp-1.16/
DOI:
Bibkey:
Cite (ACL):: Catherine Nana Nyaah Essuman and Jan Buys. 2025. Story Generation with Large Language Models for African Languages. In Proceedings of the Sixth Workshop on African Natural Language Processing (AfricaNLP 2025), pages 115–125, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Story Generation with Large Language Models for African Languages (Essuman & Buys, AfricaNLP 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/acl25-workshop-ingestion/2025.africanlp-1.16.pdf

PDF Cite Search Fix data