Tone in Perspective: A Computational Typological Analysis of Tone Function in ASR

Siyu Liang, Gina-Anne Levow


Abstract
This study investigates the impact of pitch flattening on automatic speech recognition (ASR) performance across tonal and non-tonal languages. Using vocoder-based signal processing techniques, we created pitch-flattened versions of speech recordings and compared ASR performance against original recordings. Results reveal that tonal languages experience substantially larger performance degradation than non-tonal languages. Analysis of tone confusion matrices shows systematic patterns of misidentification where contour tones collapse toward level tones when pitch information is removed. Calculation of tone’s functional load at syllable and word levels demonstrates that syllable-level functional load strongly predicts ASR vulnerability to pitch flattening, while word-level patterns reflect each language’s morphological structure. These findings illuminate the differential importance of pitch information across languages and suggest that ASR systems for languages with high syllable-level functional load require more robust pitch modeling.
Anthology ID:
2025.sigtyp-1.11
Volume:
Proceedings of the 7th Workshop on Research in Computational Linguistic Typology and Multilingual NLP
Month:
August
Year:
2025
Address:
Vinenna. Austria
Editors:
Michael Hahn, Priya Rani, Ritesh Kumar, Andreas Shcherbakov, Alexey Sorokin, Oleg Serikov, Ryan Cotterell, Ekaterina Vylomova
Venues:
SIGTYP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
82–92
Language:
URL:
https://preview.aclanthology.org/landing_page/2025.sigtyp-1.11/
DOI:
Bibkey:
Cite (ACL):
Siyu Liang and Gina-Anne Levow. 2025. Tone in Perspective: A Computational Typological Analysis of Tone Function in ASR. In Proceedings of the 7th Workshop on Research in Computational Linguistic Typology and Multilingual NLP, pages 82–92, Vinenna. Austria. Association for Computational Linguistics.
Cite (Informal):
Tone in Perspective: A Computational Typological Analysis of Tone Function in ASR (Liang & Levow, SIGTYP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/landing_page/2025.sigtyp-1.11.pdf