Revisiting Age of Acquisition in Curriculum Learning: Disentangling Lexical Features and Semantic Structure

Ian Gifford, Aaron Shah, Catherine Chen, Taimaa Kassab Bachi, Eva Portelance


Abstract
Previous work has found that ordering training data by children’s Age of Acquisition (AoA) for words increases the stability of distributional word embeddings, suggesting that early-learned words play a privileged role in shaping semantic structure. In this study, we determine whether AoA itself drives these effects, or whether they emerge from correlated lexical factors such as frequency, concreteness, and phonological complexity. Using incremental Word2Vec training, we construct curricula ordered by AoA and by individual lexical features, while systematically controlling for vocabulary growth and deterministic ordering effects. We show that AoA-ordered curricula produce greater early-phase stability than shuffled baselines, even under controlled exposure conditions. We find that the advantage observed with AoA can be largely explained by correlated factors like overall word frequency. Despite limited gains on general similarity benchmarks, AoA-ordered embeddings outperform shuffled embeddings on a proxy domain-specific task: predicting human AoA norms. This advantage persists after debiasing timestamp effects, implying that AoA curricula induce developmentally meaningful semantic structure.
Anthology ID:
2026.conll-main.40
Volume:
Proceedings of the 30th Conference on Computational Natural Language Learning
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Claire Bonial, Yevgeni Berzak
Venues:
CoNLL | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
661–676
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.conll-main.40/
DOI:
Bibkey:
Cite (ACL):
Ian Gifford, Aaron Shah, Catherine Chen, Taimaa Kassab Bachi, and Eva Portelance. 2026. Revisiting Age of Acquisition in Curriculum Learning: Disentangling Lexical Features and Semantic Structure. In Proceedings of the 30th Conference on Computational Natural Language Learning, pages 661–676, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
Revisiting Age of Acquisition in Curriculum Learning: Disentangling Lexical Features and Semantic Structure (Gifford et al., CoNLL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.conll-main.40.pdf