Child-directed speech facilitates production, not comprehension, in BabyLMs

Bastian Bunzeck, Sina Zarrieß


Abstract
Recent studies suggest that child-directed speech is not conducive to language learning in BabyLMs. However, current evaluations focus predominantly on comprehension and not production, which is central to usage-based theories of language acquisition which argue how CDS facilitates early language use through constructional ”frames” (frequent lexical patterns with open slots). We introduce a novel generation-based evaluation inspired by such theories in form of a **frame-completion task**, and compare Llama models trained with CDS, the BabyLM corpus, and web-crawl data (FineWeb-edu) on comprehension benchmarks and our novel framework. Our results reveal a clear dissociation between models’ comprehension and production capabilities: while FineWeb-trained models excel at minimal pairs, CDS-trained models produce grammatical completions substantially earlier in training and concentrate probability mass on appropriate slot-fillers. These findings show that comprehension benchmarks underestimate what CDS affords to BabyLMs.
Anthology ID:
2026.conll-main.14
Volume:
Proceedings of the 30th Conference on Computational Natural Language Learning
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Claire Bonial, Yevgeni Berzak
Venues:
CoNLL | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
227–249
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.conll-main.14/
DOI:
Bibkey:
Cite (ACL):
Bastian Bunzeck and Sina Zarrieß. 2026. Child-directed speech facilitates production, not comprehension, in BabyLMs. In Proceedings of the 30th Conference on Computational Natural Language Learning, pages 227–249, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
Child-directed speech facilitates production, not comprehension, in BabyLMs (Bunzeck & Zarrieß, CoNLL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.conll-main.14.pdf