Retrieval-Augmented Generation Meets Local Languages for Improved Drug Information Access and Comprehension.
Ahmad Ibrahim Ismail, Bashirudeen Opeyemi Ibrahim, Olubayo Adekanmbi, Ife Adebara
Abstract
Medication errors are among the leading causes of avoidable harm in healthcare systems across the world. A large portion of these errors stem from inefficient information retrieval processes and lack of comprehension of drug information. In low-resource settings, these issues are exacerbated by limited access to updated and reliable sources, technological constraints, and linguistic barriers. Innovations to improve the retrieval and comprehension of drug-related information are therefore poised to reduce medication errors and improve patient outcomes. This research employed open-source Retrieval-Augmented Generation (RAG) integrated with multilingual translation and Text-to-Speech (TTS) systems. Using open-source tools, a corpus was created from prominent sources of medical information in Nigeria and stored as high-level text embeddings in a Chroma database. Upon user query, relevant drug information is retrieved and synthesized using a large language model. This can be translated into Yoruba, Igbo, and Hausa languages, and converted into speech through the TTS system, addressing the linguistic accessibility gap. Evaluation of the system by domain experts indicated impressive overall performance in translation, achieving an average accuracy of 73%, and the best performance observed in Hausa and Yoruba. TTS results were moderately effective (mean = 57%), with Igbo scoring highest in speech clarity (68%). However, tonal complexity, especially in Yoruba, posed challenges for accurate pronunciation, highlighting the need for language-specific model fine-tuning. Addressing these linguistic nuances is essential to optimize comprehension and practical utility in diverse healthcare settings. The results demonstrates systems the potential to improve access to drug information, enhance comprehension, and reduce linguistic barriers. These technologies could substantially mitigate medication errors and improve patient safety. This study offers valuable insights and practical guidelines for future implementations aimed at strengthening global medication safety practices.- Anthology ID:
- 2025.africanlp-1.15
- Volume:
- Proceedings of the Sixth Workshop on African Natural Language Processing (AfricaNLP 2025)
- Month:
- July
- Year:
- 2025
- Address:
- Vienna, Austria
- Editors:
- Constantine Lignos, Idris Abdulmumin, David Adelani
- Venues:
- AfricaNLP | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 108–114
- Language:
- URL:
- https://preview.aclanthology.org/acl25-workshop-ingestion/2025.africanlp-1.15/
- DOI:
- Cite (ACL):
- Ahmad Ibrahim Ismail, Bashirudeen Opeyemi Ibrahim, Olubayo Adekanmbi, and Ife Adebara. 2025. Retrieval-Augmented Generation Meets Local Languages for Improved Drug Information Access and Comprehension.. In Proceedings of the Sixth Workshop on African Natural Language Processing (AfricaNLP 2025), pages 108–114, Vienna, Austria. Association for Computational Linguistics.
- Cite (Informal):
- Retrieval-Augmented Generation Meets Local Languages for Improved Drug Information Access and Comprehension. (Ismail et al., AfricaNLP 2025)
- PDF:
- https://preview.aclanthology.org/acl25-workshop-ingestion/2025.africanlp-1.15.pdf