Early Detection and Reduction of Memorization for Domain Adaptation and Instruction Tuning

Dean L. Slack; Noura Al Moubayed

doi:10.1162/tacl.a.49

Early Detection and Reduction of Memorization for Domain Adaptation and Instruction Tuning

Abstract

Although large language models excel across many tasks, they can memorize training data and thereby expose private or copyrighted text. Most defenses target the pre-training stage, leaving memorization during fine-tuning–especially for domain adaptation and instruction tuning–poorly understood. We fine-tune Pythia, Llama3, and Mistral models spanning 1.4B–70B parameters on common evaluation datasets and track verbatim memorization throughout training. We find that memorization increases dramatically in the first few epochs, often significantly before either validation perplexity or evaluation performance is optimized. We use a simple but effective n-gram memorization score which reliably precedes verbatim memorization; using it as an early-stopping criterion mitigates memorization with minimal performance loss. Further, we introduce an n-gram–aware loss regularizer and show that it reduces memorization across all model families tested by up to 40% while minimizing evaluation performance trade-offs when compared to an existing memorization mitigation strategy. These results yield practical, scalable insights into memorization dynamics during language model fine-tuning.

Anthology ID:: 2025.tacl-1.66
Volume:: Transactions of the Association for Computational Linguistics, Volume 13
Month:
Year:: 2025
Address:: Cambridge, MA
Venue:: TACL
SIG:
Publisher:: MIT Press
Note:
Pages:: 1459–1473
Language:
URL:: https://preview.aclanthology.org/fix-opsupmap-display/2025.tacl-1.66/
DOI:: 10.1162/tacl.a.49
Bibkey:
Cite (ACL):: Dean L. Slack and Noura Al Moubayed. 2025. Early Detection and Reduction of Memorization for Domain Adaptation and Instruction Tuning. Transactions of the Association for Computational Linguistics, 13:1459–1473.
Cite (Informal):: Early Detection and Reduction of Memorization for Domain Adaptation and Instruction Tuning (Slack & Al Moubayed, TACL 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/fix-opsupmap-display/2025.tacl-1.66.pdf

PDF Cite Search Fix data