Swiss German Speech Translation and the Curse of Multidialectality

Martin Bär, Andrea DeMarco, Gorka Labaka


Abstract
In many languages, non-standardized varieties make the development of NLP models challenging. This paper explores various fine-tuning techniques and data setups for training Swiss German to Standard German speech-to-text translation models. While fine-tuning on all available Swiss German data yields the best results, ASR pre-training lowers performance by 1.48 BLEU points, and jointly training on Swiss and Standard German data reduces it by 2.29 BLEU. Our dialect transfer experiments suggest that an equivalent of the Curse of Multilinguality (Conneau et al., 2020) exists in dialectal speech processing, as training on multiple dialects jointly tends to decrease single-dialect performance. However, introducing small amounts of dialectal variability can improve the performance for low-resource dialects.
Anthology ID:
2025.iwslt-1.15
Volume:
Proceedings of the 22nd International Conference on Spoken Language Translation (IWSLT 2025)
Month:
July
Year:
2025
Address:
Vienna, Austria (in-person and online)
Editors:
Elizabeth Salesky, Marcello Federico, Antonis Anastasopoulos
Venues:
IWSLT | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
165–179
Language:
URL:
https://preview.aclanthology.org/landing_page/2025.iwslt-1.15/
DOI:
Bibkey:
Cite (ACL):
Martin Bär, Andrea DeMarco, and Gorka Labaka. 2025. Swiss German Speech Translation and the Curse of Multidialectality. In Proceedings of the 22nd International Conference on Spoken Language Translation (IWSLT 2025), pages 165–179, Vienna, Austria (in-person and online). Association for Computational Linguistics.
Cite (Informal):
Swiss German Speech Translation and the Curse of Multidialectality (Bär et al., IWSLT 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/landing_page/2025.iwslt-1.15.pdf