Probing Neural Network Generalization using Default Patterns

Brandon Prickett, Tianyi Nyu, Katya Pertsova


Abstract
Whether neural-net models can learn minoritydefault patterns has been a matter of some controversy. Results based on modeling real human language data are hard to interpret due to complexity. Therefore, we examine the learning of a simple artificial language pattern involving defaults using three computational models”:" an Encoder-Decoder RNN, a Transformer Encoder, and a Logistic Regression. Overall, we find that the models have the hardest time with minority defaults, but can eventually learn them and apply them to novel words (although not always extend them to completely novel segments or novel CV-sequences). Typefrequency has the largest effect on learning in all models, trumping the effect of distribution. We examine the weights of two models to provide further insights into how defaults are represented inside the models.
Anthology ID:
2025.sigmorphon-main.4
Volume:
Proceedings of the The 22nd SIGMORPHON workshop on Computational Morphology, Phonology, and Phonetics
Month:
May
Year:
2025
Address:
Albuquerque, New Mexico, USA
Editors:
Garrett Nicolai, Eleanor Chodroff, Frederic Mailhot, Çağrı Çöltekin
Venues:
SIGMORPHON | WS
SIG:
SIGMORPHON
Publisher:
Association for Computational Linguistics
Note:
Pages:
34–44
Language:
URL:
https://preview.aclanthology.org/fix-sig-urls/2025.sigmorphon-main.4/
DOI:
Bibkey:
Cite (ACL):
Brandon Prickett, Tianyi Nyu, and Katya Pertsova. 2025. Probing Neural Network Generalization using Default Patterns. In Proceedings of the The 22nd SIGMORPHON workshop on Computational Morphology, Phonology, and Phonetics, pages 34–44, Albuquerque, New Mexico, USA. Association for Computational Linguistics.
Cite (Informal):
Probing Neural Network Generalization using Default Patterns (Prickett et al., SIGMORPHON 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/fix-sig-urls/2025.sigmorphon-main.4.pdf