Utilizing POS-Driven Pitch Contour Analysis for Enhanced Tamil Text-to-Speech Synthesis

Preethi Thinakaran, Anushiya Rachel Gladston, P Vijayalakshmi, T Nagarajan, Malarvizhi Muthuramalingam, Sooriya S


Abstract
A novel approach to text-to-speech synthesis that integrates pitch contour labels derived from the highest occurrence analysis for each Part-of-Speech (POS) tag. Using the Stanford POS Tagger, grammatical tags are assigned to words, and the most frequently occurring pitch contour labels associated with these tags are analyzed, focusing on both unigram and bigram statistics. The primary goal is to identify the pitch contour for each POS tag based on its frequency of occurrence. These pitch contour labels are incorporated into the output of the synthesized waveform using the TD-PSOLA (Time Domain Pitch Synchronous Overlap and Add) signal processing algorithm. The resulting waveform is evaluated using Mean Opinion Scores (MOS), demonstrating significant enhancements in quality and producing a prosodically rich synthetic speech.
Anthology ID:
2024.icon-1.82
Volume:
Proceedings of the 21st International Conference on Natural Language Processing (ICON)
Month:
December
Year:
2024
Address:
AU-KBC Research Centre, Chennai, India
Editors:
Sobha Lalitha Devi, Karunesh Arora
Venue:
ICON
SIG:
Publisher:
NLP Association of India (NLPAI)
Note:
Pages:
269–273
Language:
URL:
https://preview.aclanthology.org/icon-24-ingestion/2024.icon-1.82/
DOI:
Bibkey:
Cite (ACL):
Preethi Thinakaran, Anushiya Rachel Gladston, P Vijayalakshmi, T Nagarajan, Malarvizhi Muthuramalingam, and Sooriya S. 2024. Utilizing POS-Driven Pitch Contour Analysis for Enhanced Tamil Text-to-Speech Synthesis. In Proceedings of the 21st International Conference on Natural Language Processing (ICON), pages 269–273, AU-KBC Research Centre, Chennai, India. NLP Association of India (NLPAI).
Cite (Informal):
Utilizing POS-Driven Pitch Contour Analysis for Enhanced Tamil Text-to-Speech Synthesis (Thinakaran et al., ICON 2024)
Copy Citation:
PDF:
https://preview.aclanthology.org/icon-24-ingestion/2024.icon-1.82.pdf