Soeun Kim
2026
DUSK: Do Not Unlearn Shared Knowledge
Wonje Jeung | Sangyeon Yoon | Hyesoo Hong | Soeun Kim | Seungju Han | Youngjae Yu | Albert No
Findings of the Association for Computational Linguistics: ACL 2026
Wonje Jeung | Sangyeon Yoon | Hyesoo Hong | Soeun Kim | Seungju Han | Youngjae Yu | Albert No
Findings of the Association for Computational Linguistics: ACL 2026
Machine unlearning aims to remove “forget” data while preserving knowledge from the “retain” data, yet a fundamental question arises when the two share content. By definition, an unlearned model should be indistinguishable from a model retrained solely on the retain set, which implies that shared knowledge must remain while only forget-specific content is removed. To evaluate this requirement, we introduce DUSK, the first benchmark for unlearning under realistic knowledge overlap. DUSK constructs documents containing both shared and unique knowledge and defines seven metrics to test whether methods erase forget-specific expressions without discarding shared facts. Evaluating nine recent approaches, we find that although surface text is often removed, current methods struggle to distinguish shared from unique knowledge, either erasing information that should be retained or failing to fully forget target content. DUSK provides a controlled, reproducible testbed for diagnosing these failures and guiding precise unlearning algorithms.
2025
Assigning Distinct Roles to Quantized and Low-Rank Matrices Toward Optimal Weight Decomposition
Yoonjun Cho | Soeun Kim | Dongjae Jeon | Kyelim Lee | Beomsoo Lee | Albert No
Findings of the Association for Computational Linguistics: ACL 2025
Yoonjun Cho | Soeun Kim | Dongjae Jeon | Kyelim Lee | Beomsoo Lee | Albert No
Findings of the Association for Computational Linguistics: ACL 2025
Decomposing weight matrices into quantization and low-rank components ( W≈ Q+LR) is a widely used technique for compressing large language models (LLMs). Existing joint optimization methods iteratively alternate between quantization and low-rank approximation. However, these methods tend to prioritize one component at the expense of the other, resulting in suboptimal decompositions that fail to leverage each component’s unique strengths. In this work, we introduce Outlier-Driven Low-Rank Initialization (ODLRI), which assigns low-rank components the specific role of capturing activation-sensitive weights. This structured decomposition mitigates outliers’ negative impact on quantization, enabling more effective balance between quantization and low-rank approximation. Experiments on Llama2 (7B, 13B, 70B), Llama3-8B, and Mistral-7B demonstrate that incorporating ODLRI into the joint optimization framework consistently reduces activation-aware error, minimizes quantization scale, and improves perplexity and zero-shot accuracy in low-bit settings.