Aimée Lahaussois
2026
What Do Neural Speech Models Know About Phonology? Evidence from Structured Phoneme Confusions
Eli Stafford | Aimée Lahaussois | Guillaume Wisniewski
Findings of the Association for Computational Linguistics: ACL 2026
Eli Stafford | Aimée Lahaussois | Guillaume Wisniewski
Findings of the Association for Computational Linguistics: ACL 2026
ASR errors are typically analysed at the phoneme level, treating phonemes as atomic symbols. In this work, we instead adopt a featural representation of phonemes, grounded in phonological theory, which models speech sounds as structured bundles of distinctive articulatory and acoustic properties. This perspective allows us to analyse recognition errors at a finer granularity and to investigate whether certain phonological features are more vulnerable than others. Across multiple languages, we show that phoneme confusions are strongly structured in phonological feature space: errors are predominantly local and exhibit systematic asymmetries that reveal a small set of weakly modelled features. These findings have direct implications both for the design and diagnosis of ASR systems and for cognitive models of human speech perception, where similar feature-level asymmetries have long been observed.