Konkani ASR

Swapnil Fadte, Gaurish Thakkar, Jyoti D. Pawar


Abstract
Konkani is a resource-scarce language, mainly spoken on the west coast of India. The lack of resources directly impacts the development of language technology tools and services. Therefore, the development of digital resources is required to aid in the improvement of this situation. This paper describes the work on the Automatic Speech Recognition (ASR) System for Konkani language. We have created the ASR by fine-tuning the whisper-small ASR model with 100 hours of Konkani speech corpus data. The baseline model showed a word error rate (WER) of 17, which serves as evidence for the efficacy of the fine-tuning procedure in establishing ASR accuracy for Konkani language.
Anthology ID:
2023.icon-1.31
Volume:
Proceedings of the 20th International Conference on Natural Language Processing (ICON)
Month:
December
Year:
2023
Address:
Goa University, Goa, India
Editors:
Jyoti D. Pawar, Sobha Lalitha Devi
Venue:
ICON
SIG:
SIGLEX
Publisher:
NLP Association of India (NLPAI)
Note:
Pages:
393–397
Language:
URL:
https://preview.aclanthology.org/sigedu-bea-out-of-sync-correction/2023.icon-1.31/
DOI:
Bibkey:
Cite (ACL):
Swapnil Fadte, Gaurish Thakkar, and Jyoti D. Pawar. 2023. Konkani ASR. In Proceedings of the 20th International Conference on Natural Language Processing (ICON), pages 393–397, Goa University, Goa, India. NLP Association of India (NLPAI).
Cite (Informal):
Konkani ASR (Fadte et al., ICON 2023)
Copy Citation:
PDF:
https://preview.aclanthology.org/sigedu-bea-out-of-sync-correction/2023.icon-1.31.pdf