MUSIA: Multilingual Story Illustration Corpus for Cross-Cultural Alignment and Generation

Krishna Tewari, Supriya Chanda, Nirmit Patil, Sukomal Pal


Abstract
Recent advances in text-to-image generation have enabled automated visual storytelling, yet most existing datasets remain monolingual and culturally narrow. We introduce MUSIA, a Multilingual Story Illustration Corpus designed to advance research in cross-lingual and culturally grounded narrative illustration. MUSIA comprises bilingual (English-Hindi) story-image pairs drawn from open literary and folk sources, curated to reflect diverse cultural themes, artistic styles, and linguistic structures. Each story includes multiple illustrations aligned at the scene level, accompanied by quality-verified mappings for narrative-visual coherence. To establish a reproducible benchmark, we propose a two-stage baseline combining transformer-based semantic summarization with diffusion-based image generation, achieving strong performance in relevance, visual quality, and consistency. MUSIA represents the first step toward a scalable, culturally inclusive benchmark for multilingual visual storytelling, enabling fair and reproducible research across low-resource and underrepresented languages.
Anthology ID:
2026.lrec-main.150
Volume:
Proceedings of the Fifteenth Language Resources and Evaluation Conference
Month:
May
Year:
2026
Address:
Palma de Mallorca, Spain
Editors:
Stelios Piperidis, Núria Bel, Henk van den Heuvel, Nancy Ide, Simon Krek, Antonio Toral
Venue:
LREC
SIG:
Publisher:
ELRA Language Resource Association
Note:
Pages:
1914–1924
Language:
URL:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.150/
DOI:
Bibkey:
Cite (ACL):
Krishna Tewari, Supriya Chanda, Nirmit Patil, and Sukomal Pal. 2026. MUSIA: Multilingual Story Illustration Corpus for Cross-Cultural Alignment and Generation. International Conference on Language Resources and Evaluation, main:1914–1924.
Cite (Informal):
MUSIA: Multilingual Story Illustration Corpus for Cross-Cultural Alignment and Generation (Tewari et al., LREC 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.150.pdf