Hyesoo Hong
2026
DUSK: Do Not Unlearn Shared Knowledge
Wonje Jeung | Sangyeon Yoon | Hyesoo Hong | Soeun Kim | Seungju Han | Youngjae Yu | Albert No
Findings of the Association for Computational Linguistics: ACL 2026
Wonje Jeung | Sangyeon Yoon | Hyesoo Hong | Soeun Kim | Seungju Han | Youngjae Yu | Albert No
Findings of the Association for Computational Linguistics: ACL 2026
Machine unlearning aims to remove “forget” data while preserving knowledge from the “retain” data, yet a fundamental question arises when the two share content. By definition, an unlearned model should be indistinguishable from a model retrained solely on the retain set, which implies that shared knowledge must remain while only forget-specific content is removed. To evaluate this requirement, we introduce DUSK, the first benchmark for unlearning under realistic knowledge overlap. DUSK constructs documents containing both shared and unique knowledge and defines seven metrics to test whether methods erase forget-specific expressions without discarding shared facts. Evaluating nine recent approaches, we find that although surface text is often removed, current methods struggle to distinguish shared from unique knowledge, either erasing information that should be retained or failing to fully forget target content. DUSK provides a controlled, reproducible testbed for diagnosing these failures and guiding precise unlearning algorithms.