Memorisation Meets Compositionality in Natural Language Processing

Verna Dankers

Memorisation Meets Compositionality in Natural Language Processing

Abstract

Memorisation in deep learning is undergoing a paradigm shift; it is increasingly recognised as a mechanism that can support, rather than hinder, generalisation. This is particularly relevant in NLP, where language combines compositional, generalisable structure with non-compositional expressions such as idioms, requiring memorisation from models and humans alike. My PhD work investigated memorisation in transformer models in generic terms, and through the lens of (non-)compositionality, from both data and model-internal perspectives. I analysed which training examples require memorisation, whether memorisation supports generalisation, and where memorisation occurs within model layers. I also studied how transformers process non-compositional idiom translations and how they balance compositional generalisation with non-compositional memorisation. Based on my findings, I stress that memorisation is an inherent part of learning natural language, can be beneficial, and is partially predictable. Yet it is not cleanly separable from generalisation, both at the level of data and of model parameters. Here, I summarise those findings and reflect on my PhD work.

Anthology ID:: 2026.bigpicture-main.12
Volume:: Proceedings of The Big Picture v2: Crafting a Research Narrative
Month:: July
Year:: 2026
Address:: San Diego, CA, USA
Editors:: Yanai Elazar, Allyson Ettinger, Nora Kassner, Sebastian Ruder
Venues:: BigPicture | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 144–159
Language:
URL:: https://preview.aclanthology.org/ingest-acl-workshops/2026.bigpicture-main.12/
DOI:
Bibkey:
Cite (ACL):: Verna Dankers. 2026. Memorisation Meets Compositionality in Natural Language Processing. In Proceedings of The Big Picture v2: Crafting a Research Narrative, pages 144–159, San Diego, CA, USA. Association for Computational Linguistics.
Cite (Informal):: Memorisation Meets Compositionality in Natural Language Processing (Dankers, BigPicture 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl-workshops/2026.bigpicture-main.12.pdf

PDF Cite Search Fix data