Luisa Geiger
2025
Evaluating Spatiotemporal Consistency in Automatically Generated Sewing Instructions
Luisa Geiger
|
Mareike Hartmann
|
Michael Sullivan
|
Alexander Koller
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
In this paper, we propose a novel, automatic tree-based evaluation metric for LLM-generated step-by-step assembly instructions, that more accurately reflects spatiotemporal aspects of construction than traditional metrics such as BLEU and BERT similarity scores. We apply our proposed metric to the domain of sewing instructions, and show that our metric better correlates with manually-annotated error counts, demonstrating our metric’s superiority for evaluating the spatiotemporal soundness of sewing instructions. Further experiments show that our metric is more robust than traditional approaches against artificially-constructed counterfactual examples that are specifically constructed to confound metrics that rely on textual similarity.
2021
SHAPELURN: An Interactive Language Learning Game with Logical Inference
Katharina Stein
|
Leonie Harter
|
Luisa Geiger
Proceedings of the First Workshop on Interactive Learning for Natural Language Processing
We investigate if a model can learn natural language with minimal linguistic input through interaction. Addressing this question, we design and implement an interactive language learning game that learns logical semantic representations compositionally. Our game allows us to explore the benefits of logical inference for natural language learning. Evaluation shows that the model can accurately narrow down potential logical representations for words over the course of the game, suggesting that our model is able to learn lexical mappings from scratch successfully.