CroCoSyn: A Cross-Lingual and Cross-Model Corpus of LLM-Generated Film Synopses

Louis Escouflaire


Abstract
We introduce CroCoSyn, a controlled, cross-lingual and cross-model corpus of 25,920 LLM-generated film synopses in English and French. Each synopsis is generated under systematically varied conditions, including model type, temperature, genre, protagonist gender, and narrative constraints, and enriched with structured metadata capturing characters and their relationships. Comparing Mistral and Llama across different model temperature degrees, CroCoSyn enables fine-grained analysis of narrative content, style, and character representation across models and languages. The corpus supports research on gender and cultural biases and story generation evaluation, and provides a foundation for comparative studies between LLM-generated and human-written narratives.
Anthology ID:
2026.latechclfl-1.4
Volume:
Proceedings of the 10th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature 2026
Month:
March
Year:
2026
Address:
Rabat, Morocco
Editors:
Diego Alves, Yuri Bizzoni, Stefania Degaetano-Ortlieb, Anna Kazantseva, Janis Pagel, Stan Szpakowicz
Venues:
LaTeCH-CLfL | WS
SIG:
SIGHUM
Publisher:
Association for Computational Linguistics
Note:
Pages:
30–35
Language:
URL:
https://preview.aclanthology.org/ingest-eacl/2026.latechclfl-1.4/
DOI:
Bibkey:
Cite (ACL):
Louis Escouflaire. 2026. CroCoSyn: A Cross-Lingual and Cross-Model Corpus of LLM-Generated Film Synopses. In Proceedings of the 10th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature 2026, pages 30–35, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):
CroCoSyn: A Cross-Lingual and Cross-Model Corpus of LLM-Generated Film Synopses (Escouflaire, LaTeCH-CLfL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-eacl/2026.latechclfl-1.4.pdf
Supplementarymaterial:
 2026.latechclfl-1.4.SupplementaryMaterial.zip
Supplementarymaterial:
 2026.latechclfl-1.4.SupplementaryMaterial.txt