Anatomy of Unlearning: The Dual Impact of Fact Salience and Model Fine-Tuning

Anna Borisiuk, Andrey Savchenko, Alexander Panchenko, Elena Tutubalina


Abstract
Machine Unlearning (MU) enables Large Language Models (LLMs) to remove unsafe or outdated information. However, existing work assumes that all facts are equally forgettable and largely ignores whether the forgotten knowledge originates from pretraining or supervised fine-tuning (SFT). In this paper, we introduce DUAL (Dual Unlearning Evaluation across Training Stages), a benchmark of 28.6k Wikidata-derived triplets annotated with fact popularity using Wikipedia link counts and LLM-based salience scores. Our experiments show that pretrained and SFT models respond differently to unlearning. An SFT step on the forget data yields smoother forgetting, more stable tuning, and 10–50% higher retention, while direct unlearning on pretrained models remains unstable and prone to relearning or catastrophic forgetting.
Anthology ID:
2026.findings-acl.1287
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
25846–25859
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1287/
DOI:
Bibkey:
Cite (ACL):
Anna Borisiuk, Andrey Savchenko, Alexander Panchenko, and Elena Tutubalina. 2026. Anatomy of Unlearning: The Dual Impact of Fact Salience and Model Fine-Tuning. In Findings of the Association for Computational Linguistics: ACL 2026, pages 25846–25859, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Anatomy of Unlearning: The Dual Impact of Fact Salience and Model Fine-Tuning (Borisiuk et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1287.pdf
Checklist:
 2026.findings-acl.1287.checklist.pdf