Prosody Labelled Dataset for Hindi

Esha Banerjee, Atul Kr. Ojha, Girish Jha


Abstract
This study aims to develop an intonation labelled database for Hindi, for enhancing prosody in ASR and TTS systems, which is also helpful for building Speech to Speech Machine Translation systems. Although no single standard for prosody labelling exists in Hindi, researchers in the past have employed perceptual and statistical methods in literature to draw inferences about the behaviour of prosody patterns in Hindi. Based on such existing research and largely agreed upon intonational theories in Hindi, this study attempts to develop a manually annotated prosodic corpus of Hindi speech data, which can be used for training speech models for natural-sounding speech in the future. 500 sentences (2,550 words) for declarative and interrogative types have been labelled using Praat.
Anthology ID:
2021.smp-1.2
Volume:
Proceedings of the Workshop on Speech and Music Processing 2021
Month:
December
Year:
2021
Address:
NIT Silchar, India
Editors:
Anupam Biswas, Rabul Hussain Laskar, Pinki Roy
Venue:
SMP
SIG:
Publisher:
NLP Association of India (NLPAI)
Note:
Pages:
14–19
Language:
URL:
https://aclanthology.org/2021.smp-1.2
DOI:
Bibkey:
Cite (ACL):
Esha Banerjee, Atul Kr. Ojha, and Girish Jha. 2021. Prosody Labelled Dataset for Hindi. In Proceedings of the Workshop on Speech and Music Processing 2021, pages 14–19, NIT Silchar, India. NLP Association of India (NLPAI).
Cite (Informal):
Prosody Labelled Dataset for Hindi (Banerjee et al., SMP 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-bitext-workshop/2021.smp-1.2.pdf