Central Kurdish Text-to-Speech and Its Application in Speech-to-Text Translation

Mohammad Mohammadamini, Meysam Shamsi, Marie Tahon


Abstract
In this study, we show how from available resources develop high-quality TTS models for low-resource scenarios that according to our extensive evaluation surpass the models trained on dedicated TTS data recorded in the studio. We develop three Text-to-Speech (TTS) models for Central Kurdish as a low-resource language using F5-TTS architecture. The models are trained on Central Kurdish TTS datasets in which two of them are curated from audiobooks during this study and the third one is evaluated for the first time. We also demonstrate the potential of TTS models for developing other speech technologies in low-resource languages by proposing a speech synthesis framework used in a speech-to-text translation application, achieving promising results on standard speech translation benchmarks. The curated TTS resources and models will be publicly available under CC BY-NC-ND 4.0 license
Anthology ID:
2026.lrec-main.48
Volume:
Proceedings of the Fifteenth Language Resources and Evaluation Conference
Month:
May
Year:
2026
Address:
Palma de Mallorca, Spain
Editors:
Stelios Piperidis, Núria Bel, Henk van den Heuvel, Nancy Ide, Simon Krek, Antonio Toral
Venue:
LREC
SIG:
Publisher:
ELRA Language Resource Association
Note:
Pages:
664–673
Language:
URL:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.48/
DOI:
Bibkey:
Cite (ACL):
Mohammad Mohammadamini, Meysam Shamsi, and Marie Tahon. 2026. Central Kurdish Text-to-Speech and Its Application in Speech-to-Text Translation. International Conference on Language Resources and Evaluation, main:664–673.
Cite (Informal):
Central Kurdish Text-to-Speech and Its Application in Speech-to-Text Translation (Mohammadamini et al., LREC 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.48.pdf