Max Kreminski
2025
LLMs Behind the Scenes: Enabling Narrative Scene Illustration
Melissa Roemmele
|
John Joon Young Chung
|
Taewook Kim
|
Yuqian Sun
|
Alex Calderwood
|
Max Kreminski
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Generative AI has established the opportunity to readily transform content from one medium to another. This capability is especially powerful for storytelling, where visual illustrations can illuminate a story originally expressed in text. In this paper, we focus on the task of narrative scene illustration, which involves automatically generating an image depicting a scene in a story. Motivated by recent progress on text-to-image models, we consider a pipeline that uses LLMs as an interface for prompting text-to-image models to generate scene illustrations given raw story text. We apply variations of this pipeline to a prominent story corpus in order to synthesize illustrations for scenes in these stories. We conduct a human annotation task to obtain pairwise quality judgments for these illustrations. The outcome of this process is the SceneIllustrations dataset, which we release as a new resource for future work on cross-modal narrative transformation. Through our analysis of this dataset and experiments modeling illustration quality, we demonstrate that LLMs can effectively verbalize scene knowledge implicitly evoked by story text. Moreover, this capability is impactful for generating and evaluating illustrations.
2022
Unmet Creativity Support Needs in Computationally Supported Creative Writing
Max Kreminski
|
Chris Martens
Proceedings of the First Workshop on Intelligent and Interactive Writing Assistants (In2Writing 2022)
Large language models (LLMs) enabled by the datasets and computing power of the last decade have recently gained popularity for their capacity to generate plausible natural language text from human-provided prompts. This ability makes them appealing to fiction writers as prospective co-creative agents, addressing the common challenge of writer’s block, or getting unstuck. However, creative writers face additional challenges, including maintaining narrative consistency, developing plot structure, architecting reader experience, and refining their expressive intent, which are not well-addressed by current LLM-backed tools. In this paper, we define these needs by grounding them in cognitive and theoretical literature, then survey previous computational narrative research that holds promise for supporting each of them in a co-creative setting.
Search
Fix author
Co-authors
- Alex Calderwood 1
- John Joon Young Chung 1
- Taewook Kim 1
- Chris Martens 1
- Melissa Roemmele 1
- show all...