An LLM Investigation into Inherent and Structural Case Representation: a German Case Study

Iona Carslaw, András Bárány, Itamar Kastner, Mark Steedman


Abstract
A question for computational linguistics has been to what degree do language models encode case information. However, the majority of the work has focused on structural cases (cases which change when the syntactic configuration changes). On the other hand, inherent cases (which are assigned by specific lexical items and do not change if the syntactic configuration changes) have been overlooked. This paper sets out to investigate if German language models distinctly encode inherent dative from structural accusative and nominative. We conducted a linguistic probing investigation where probes are trained on contextual word embeddings of active nominative, accusative, and dative arguments to predict if passivised datives are analysed as a structural nominative. We provide a cased and caseless version of the experiment. Our results suggest that when case information is removed language models can distinguish between inherent dative and structural accusative, regardless of argument position, due to verb information. However, language models cannot distinguish between structural nominative and inherent dative when the dative appears in a position where there is an expected nominative, due to over-relying on surface patterns.
Anthology ID:
2026.scil-main.8
Volume:
Proceedings of the Society for Computation in Linguistics 2026
Month:
July
Year:
2026
Address:
San Diego, CA
Editors:
Rob Voigt, Alex Warstadt, Naomi Feldman, Tal Linzen
Venues:
SCiL | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
72–89
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.scil-main.8/
DOI:
Bibkey:
Cite (ACL):
Iona Carslaw, András Bárány, Itamar Kastner, and Mark Steedman. 2026. An LLM Investigation into Inherent and Structural Case Representation: a German Case Study. In Proceedings of the Society for Computation in Linguistics 2026, pages 72–89, San Diego, CA. Association for Computational Linguistics.
Cite (Informal):
An LLM Investigation into Inherent and Structural Case Representation: a German Case Study (Carslaw et al., SCiL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.scil-main.8.pdf