Enriching children’s stories with LLMs: Delivering multilingual data enrichment for children’s books at scale and across markets

Zarah Weiss, Christof Meyer, Mikael Andersson


Abstract
This paper presents a user-centered, empirically guided approach to multilingual metadata enrichment for children’s books. We combine LLMs with human-in-the-loop quality control in a scalable CI/CD pipeline to curate brand collections that enhance book discovery and engagement for young readers across multiple European markets. Our results demonstrate that this hybrid approach delivers high-quality, child-appropriate labels, improves user experience, and accelerates deployment in real-world production environments. This work offers practical insights for applying generative NLP in the media and publishing industry.
Anthology ID:
2025.acl-industry.56
Volume:
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track)
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Georg Rehm, Yunyao Li
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
804–812
Language:
URL:
https://preview.aclanthology.org/mtsummit-25-ingestion/2025.acl-industry.56/
DOI:
10.18653/v1/2025.acl-industry.56
Bibkey:
Cite (ACL):
Zarah Weiss, Christof Meyer, and Mikael Andersson. 2025. Enriching children’s stories with LLMs: Delivering multilingual data enrichment for children’s books at scale and across markets. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track), pages 804–812, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Enriching children’s stories with LLMs: Delivering multilingual data enrichment for children’s books at scale and across markets (Weiss et al., ACL 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/mtsummit-25-ingestion/2025.acl-industry.56.pdf