Making Synthetic Questions More Child-Directed: Prompting and Sampling Effects

Whitney Poh, Michael Tombolini, Libby Barak


Abstract
Child-directed Speech (CDS) has been shown to better support language learning as training data for computational models. Artificially generated input aims at replicating the advantage of CDS by re-creating targeted linguistic properties. Recently, the use of questions in CDS has been suggested as a linguistic property that may entail an effective discourse structure for model training. However, previous work has shown inconsistent improvement over baseline using questions in training data. In this study, we propose a new question generation method that aligns both the generation prompts and sampling methods with properties of CDS. We show that prompt wording substantially changes whether synthetic questions match CDS on surface properties such as MLU and question type. Despite marked improvements over baseline, enhanced CDS-likeness does not translate into consistent downstream gains. Overall, our results show that the role of questions in training data is a topic worth looking further into.
Anthology ID:
2026.cdl-1.18
Volume:
Proceedings of the 1st Workshop on Computational Developmental Linguistics (CDL)
Month:
July
Year:
2026
Address:
Grand Hyatt Manchester San Diego, 1 Market Pl, San Diego, CA 92101
Editors:
Martin Ziqiao Ma, Emmy Liu, Jing Liu, Tyler A. Chang, Abdellah Fourtassi, Alex Warstadt, Michael Hahn, Weiwei Sun, Freda Shi
Venues:
CDL | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
129–135
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.cdl-1.18/
DOI:
Bibkey:
Cite (ACL):
Whitney Poh, Michael Tombolini, and Libby Barak. 2026. Making Synthetic Questions More Child-Directed: Prompting and Sampling Effects. In Proceedings of the 1st Workshop on Computational Developmental Linguistics (CDL), pages 129–135, Grand Hyatt Manchester San Diego, 1 Market Pl, San Diego, CA 92101. Association for Computational Linguistics.
Cite (Informal):
Making Synthetic Questions More Child-Directed: Prompting and Sampling Effects (Poh et al., CDL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.cdl-1.18.pdf