Modelling the Diachronic Emergence of Phoneme Frequency Distributions

Fermin Moscoso Del Prado Martin, Suchir Salhan


Abstract
Phoneme frequency distributions exhibit robust statistical regularities across languages, including exponential-tailed rank-frequency patterns and a negative relationship between phonemic inventory size and the relative entropy of the distribution. The origin of these patterns remains largely unexplained. In this paper, we investigate whether they can arise as consequences of the historical processes that shape phonological systems. We introduce a stochastic model of phonological change and simulate the diachronic evolution of phoneme inventories. A naïve version of the model reproduces the general shape of phoneme rank-frequency distributions but fails to capture other empirical properties. Extending the model with two additional assumptions –an effect related to frequency and a stabilising tendency toward a preferred inventory size– yields simulations that match both the observed distributions and the negative relationship between inventory size and relative entropy. These results suggest that some statistical regularities of phonological systems may arise as a result of diachronic sound change instead of –or in addition to– explicit optimisation or compensatory mechanisms.
Anthology ID:
2026.scil-main.14
Volume:
Proceedings of the Society for Computation in Linguistics 2026
Month:
July
Year:
2026
Address:
San Diego, CA
Editors:
Rob Voigt, Alex Warstadt, Naomi Feldman, Tal Linzen
Venues:
SCiL | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
138–146
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.scil-main.14/
DOI:
Bibkey:
Cite (ACL):
Fermin Moscoso Del Prado Martin and Suchir Salhan. 2026. Modelling the Diachronic Emergence of Phoneme Frequency Distributions. In Proceedings of the Society for Computation in Linguistics 2026, pages 138–146, San Diego, CA. Association for Computational Linguistics.
Cite (Informal):
Modelling the Diachronic Emergence of Phoneme Frequency Distributions (Moscoso Del Prado Martin & Salhan, SCiL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.scil-main.14.pdf