Is a cute puyfred cute? Context-dependent form-meaning systematicity in LLMs

Jaïr A. Waal, Giovanni Cassani


Abstract
We investigate static and contextualized embeddings for English pseudowords across a variety of Large Language Models (LLMs), to study (i) how these models represent semantic attributes of strings they encounter for the very first time and how (ii) these representations interact with sentence context. We zoom in on a key semantic attribute, valence, which plays an important role in theories of language processing, acquisition, and evolution. Across three experiments, we show that pseudoword valence is encoded in meaningful ways both in isolation and in context, and that, in some LLMs, pseudowords affect the representation of whole sentences similarly to words. This highlights how, at least for most LLMs we surveyed, pseudowords and words are not qualitatively different constructs. Our study confirms that LLMs capture systematic mappings between form and valence, and shows how different LLMs handle the contextualisation of pseudowords differently. Our findings provide a first computational exploration of how sub-lexical distributional patterns influence the valence of novel strings in context, offering useful insights for theories on the form-meaning interface and how it affects language learning and processing.
Anthology ID:
2025.findings-acl.961
Volume:
Findings of the Association for Computational Linguistics: ACL 2025
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
18747–18769
Language:
URL:
https://preview.aclanthology.org/landing_page/2025.findings-acl.961/
DOI:
Bibkey:
Cite (ACL):
Jaïr A. Waal and Giovanni Cassani. 2025. Is a cute puyfred cute? Context-dependent form-meaning systematicity in LLMs. In Findings of the Association for Computational Linguistics: ACL 2025, pages 18747–18769, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Is a cute puyfred cute? Context-dependent form-meaning systematicity in LLMs (Waal & Cassani, Findings 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/landing_page/2025.findings-acl.961.pdf