How much capacity does Turkish inflection require? An empirical study of GRU encoder–decoder bottlenecks.

Fred Mailhot


Abstract
Encoder–decoder neural networks with high-dimensional (e.g. d=300-–500) embeddings and hidden layers can be used to model a variety of morphophonological phenomena as sequence-to-sequence mappings, achieving high accuracy across languages and patterns. We show here that these high-capacity models are overparameterized, at least for the task of morphological inflection, and that simpler and smaller networks can perform near ceiling on the task of inflecting Turkish stems. Moreover these reduced-capacity models encode linguistically relevant information even when they are too small to succeed at the inflectional task.
Anthology ID:
2026.scil-main.39
Volume:
Proceedings of the Society for Computation in Linguistics 2026
Month:
July
Year:
2026
Address:
San Diego, CA
Editors:
Rob Voigt, Alex Warstadt, Naomi Feldman, Tal Linzen
Venues:
SCiL | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
423–426
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.scil-main.39/
DOI:
Bibkey:
Cite (ACL):
Fred Mailhot. 2026. How much capacity does Turkish inflection require? An empirical study of GRU encoder–decoder bottlenecks.. In Proceedings of the Society for Computation in Linguistics 2026, pages 423–426, San Diego, CA. Association for Computational Linguistics.
Cite (Informal):
How much capacity does Turkish inflection require? An empirical study of GRU encoder–decoder bottlenecks. (Mailhot, SCiL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.scil-main.39.pdf