Speech-to-Speech Machine Translation for Dialectal Variations of Hindi

Sanmay Sood, Siddharth Rajput, Md Shad Akhtar


Abstract
Hindi has many dialects and they are vital to India’s cultural and linguistics heritage. However, many of them have been largely overlooked in modern language technological advancements, primarily due to lack proper resources. In this study, we explore speech-to-speech machine translation (S2ST) for four Hindi dialects, i.e., Awadhi, Bhojpuri, Braj Bhasha, and Magahi. We adopt a cascaded S2ST pipeline comprising of three stages: Automatic Speech Recognition (ASR), Machine Translation (MT), and Text-to-Speech (TTS). We evaluate many recent large language models (LLMs) for dialect-to-Hindi and dialect-to-English translations in zero-shot, few-shot, and chain-of-thought setups. Our comparative analysis offers insights into the current capabilities and limitations of LLM-based approaches for low-resource dialectal S2ST in Indian context.
Anthology ID:
2025.wat-1.5
Volume:
Proceedings of the Twelfth Workshop on Asian Translation (WAT 2025)
Month:
December
Year:
2025
Address:
Mumbai, India
Editors:
Toshiaki Nakazawa, Isao Goto
Venues:
WAT | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
54–65
Language:
URL:
https://preview.aclanthology.org/ingest-ijcnlp-aacl/2025.wat-1.5/
DOI:
Bibkey:
Cite (ACL):
Sanmay Sood, Siddharth Rajput, and Md Shad Akhtar. 2025. Speech-to-Speech Machine Translation for Dialectal Variations of Hindi. In Proceedings of the Twelfth Workshop on Asian Translation (WAT 2025), pages 54–65, Mumbai, India. Association for Computational Linguistics.
Cite (Informal):
Speech-to-Speech Machine Translation for Dialectal Variations of Hindi (Sood et al., WAT 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-ijcnlp-aacl/2025.wat-1.5.pdf