PictureStories: Predicting the Task Adherence of Language Learner Answers to a Picture Story-Based Writing Task
Marie Bexte, Andrew Caines, Diane Nicholls, Paula Buttery, Torsten Zesch
Abstract
We investigate the automated evaluation of English language learner answers to writing tasks featuring picture stories.This is usually limited to language proficiency only, neglecting the context of the picture. Instead, our analysis focuses on task adherence, which for example allows detection of off-topic answers.Since there is a lack of suitable training and evaluation data, our first step is to build the PictureStories dataset.To this end, we develop a marking rubric that covers task adherence with respect to both form and content. Six annotators mark 713 learner answers written in response to one of five picture stories.Having assembled the dataset, we then explore to what extent task adherence can be predicted automatically. Our experiments assume a scenario where no or just a few labelled answers are available for the picture story which is being marked.For form-focused criteria, we find that it is beneficial to finetune models across tasks.With content-focused criteria, few-shot prompting Qwen emerges as the best-performing method. We examine the trade-off between including the story image vs. example answers in the prompt and find that examples suffice in many cases. While for some LLMs, few-shot prompting results may look promising on the surface, we demonstrate that a much simpler method can do just as well when shown the same examples.- Anthology ID:
- 2026.eacl-long.108
- Volume:
- Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)
- Month:
- March
- Year:
- 2026
- Address:
- Rabat, Morocco
- Editors:
- Vera Demberg, Kentaro Inui, Lluís Marquez
- Venue:
- EACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 2398–2415
- Language:
- URL:
- https://preview.aclanthology.org/ingest-eacl/2026.eacl-long.108/
- DOI:
- Cite (ACL):
- Marie Bexte, Andrew Caines, Diane Nicholls, Paula Buttery, and Torsten Zesch. 2026. PictureStories: Predicting the Task Adherence of Language Learner Answers to a Picture Story-Based Writing Task. In Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2398–2415, Rabat, Morocco. Association for Computational Linguistics.
- Cite (Informal):
- PictureStories: Predicting the Task Adherence of Language Learner Answers to a Picture Story-Based Writing Task (Bexte et al., EACL 2026)
- PDF:
- https://preview.aclanthology.org/ingest-eacl/2026.eacl-long.108.pdf