ASR TAMIL SSN@ LT-EDI-2024: Automatic Speech Recognition system for Elderly People

Suhasini S, Bharathi B


Abstract
The results of the Shared Task on Speech Recognition for Vulnerable Individuals in Tamil (LT-EDI-2024) are discussed in this paper. The goal is to create an automated system for Tamil voice recognition. The older population that speaks Tamil is the source of the dataset used in this task. The proposed ASR system is designed with pre-trained model akashsivanandan/wav2vec2-large-xls-r300m-tamil-colab-final. The Tamil common speech dataset is utilized to fine-tune the pretrained model that powers our system. The suggested system receives the test data that was released from the task; transcriptions are then created for the test samples and delivered to the task. Word Error Rate (WER) is the evaluation statistic used to assess the provided result based on the task. Our Proposed system attained a WER of 29.297%.
Anthology ID:
2024.ltedi-1.40
Volume:
Proceedings of the Fourth Workshop on Language Technology for Equality, Diversity, Inclusion
Month:
March
Year:
2024
Address:
St. Julian's, Malta
Editors:
Bharathi Raja Chakravarthi, Bharathi B, Paul Buitelaar, Thenmozhi Durairaj, György Kovács, Miguel Ángel García Cumbreras
Venues:
LTEDI | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
294–298
Language:
URL:
https://aclanthology.org/2024.ltedi-1.40
DOI:
Bibkey:
Cite (ACL):
Suhasini S and Bharathi B. 2024. ASR TAMIL SSN@ LT-EDI-2024: Automatic Speech Recognition system for Elderly People. In Proceedings of the Fourth Workshop on Language Technology for Equality, Diversity, Inclusion, pages 294–298, St. Julian's, Malta. Association for Computational Linguistics.
Cite (Informal):
ASR TAMIL SSN@ LT-EDI-2024: Automatic Speech Recognition system for Elderly People (S & B, LTEDI-WS 2024)
Copy Citation:
PDF:
https://preview.aclanthology.org/landing_page/2024.ltedi-1.40.pdf
Video:
 https://preview.aclanthology.org/landing_page/2024.ltedi-1.40.mp4