Björn Rudzewitz

Also published as: Bjoern Rudzewitz

2026

Using LLMs for item creation: Validating the potential of automatically generated sentence repetition test items for language assessment
Sarah Löber | Björn Rudzewitz | Yuan Chu | Mengyuan He | Shiqin Liu | Yushan Ye | Xiaobin Chen
Proceedings of the 21st Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2026)

Various aspects of the Elicited Imitation Test (EIT), a sentence repetition task for language assessment, can be automated, for example in terms of test administration or automatic scoring. It is potentially also possible to generate test items with Large Language Models (LLMs). This study investigates the potential of GPT-4o for item creation in the context of EIT, creating a parallel form to two popular and validated tests. We analysed the tests in terms of their linguistic and psychometric properties. While the items created by the LLM show some difference in grammatical structures when compared to human-written items, linguistic complexity results did not differ significantly between tests. Psychometric properties showed only minor differences. These findings lend support to the potential of Automatic Item Generation with LLMs in the context of sentence repetition tasks and might support the process of standardisation in SLA research and testing by enabling parallel test creation.

2024

pdf bib

Developing a Pedagogically Oriented Interactive Reading Tool with Teachers in the Loops
Mihwa Lee | Björn Rudzewitz | Xiaobin Chen
Proceedings of the 13th Workshop on Natural Language Processing for Computer Assisted Language Learning

pdf bib

Developing a Web-Based Intelligent Language Assessment Platform Powered by Natural Language Processing Technologies
Sarah Löber | Björn Rudzewitz | Daniela Verratti Souto | Luisa Ribeiro-Flucht | Xiaobin Chen
Proceedings of the 13th Workshop on Natural Language Processing for Computer Assisted Language Learning

2021

pdf bib

Automatic annotation of curricular language targets to enrich activity models and support both pedagogy and adaptive systems
Martí Quixal | Björn Rudzewitz | Elizabeth Bear | Detmar Meurers
Proceedings of the 10th Workshop on NLP for Computer Assisted Language Learning

2019

pdf bib

The Impact of Spelling Correction and Task Context on Short Answer Assessment for Intelligent Tutoring Systems
Ramon Ziai | Florian Nuxoll | Kordula De Kuthy | Björn Rudzewitz | Detmar Meurers
Proceedings of the 8th Workshop on NLP for Computer Assisted Language Learning

2018

pdf bib

Feedback Strategies for Form and Meaning in a Real-life Language Tutoring System
Ramon Ziai | Bjoern Rudzewitz | Kordula De Kuthy | Florian Nuxoll | Detmar Meurers
Proceedings of the 7th workshop on NLP for Computer Assisted Language Learning

pdf bib abs

Generating Feedback for English Foreign Language Exercises
Björn Rudzewitz | Ramon Ziai | Kordula De Kuthy | Verena Möller | Florian Nuxoll | Detmar Meurers
Proceedings of the Thirteenth Workshop on Innovative Use of NLP for Building Educational Applications

While immediate feedback on learner language is often discussed in the Second Language Acquisition literature (e.g., Mackey 2006), few systems used in real-life educational settings provide helpful, metalinguistic feedback to learners. In this paper, we present a novel approach leveraging task information to generate the expected range of well-formed and ill-formed variability in learner answers along with the required diagnosis and feedback. We combine this offline generation approach with an online component that matches the actual student answers against the pre-computed hypotheses. The results obtained for a set of 33 thousand answers of 7th grade German high school students learning English show that the approach successfully covers frequent answer patterns. At the same time, paraphrases and content errors require a more flexible alignment approach, for which we are planning to complement the method with the CoMiC approach successfully used for the analysis of reading comprehension answers (Meurers et al., 2011).