A Shared Geometry of Difficulty in Multilingual Language Models

Stefano Civelli, Pietro Bernardelle, Nicolò Brunello, Gianluca Demartini


Abstract
Large language models (LLMs) encode problem difficulty as an internal signal that can be linearly decoded from their residuals. Given their multilingual capabilities, we investigate whether this meta-cognitive signal is language-agnostic and how it is organized across the model’s layers by training linear probes on the AMC subset of the Easy2Hard benchmark, translated into 21 languages. We found that difficulty-related signals emerge at two distinct stages of the model internals, corresponding to shallow (early-layers) and deep (later-layers) internal representations, that exhibit functionally different behaviors. Probes trained on deep representations achieve high accuracy when evaluated on the same language but exhibit weaker cross-lingual transfer. In contrast, probes trained on shallow representations generalize better across languages, despite achieving lower within-language performance. This closely aligns with existing findings in LLM interpretability, showing that models tend to operate in an abstract conceptual space before producing language-specific outputs. Our results suggest that this two-stage organizational principle extends beyond simple semantic processing to meta-cognitive properties such as problem difficulty, highlighting an internal control signal that is not tied to surface meaning.
Anthology ID:
2026.acl-short.66
Volume:
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
796–807
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.acl-short.66/
DOI:
Bibkey:
Cite (ACL):
Stefano Civelli, Pietro Bernardelle, Nicolò Brunello, and Gianluca Demartini. 2026. A Shared Geometry of Difficulty in Multilingual Language Models. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 796–807, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
A Shared Geometry of Difficulty in Multilingual Language Models (Civelli et al., ACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.acl-short.66.pdf
Checklist:
 2026.acl-short.66.checklist.pdf