Forget What Matters, Keep the Rest: Selective Unlearning of Informative Tokens

Seunghee Koh, Sunghyun Baek, Youngdong Kim, Junmo Kim


Abstract
Unlearning in large language models (LLMs) has emerged as a promising safeguard against adversarial behaviors. When the forgetting loss is applied uniformly without considering token-level semantic importance, model utility can be unnecessarily degraded. Recent studies have explored token-wise loss regularizers that prioritize informative tokens, but largely rely on ground-truth confidence or external linguistic parsers, which limits their ability to capture contextual information or the model’s overall predictive state. Intuitively, function words like “the” primarily serve syntactic roles and are highly predictable with little ambiguity, but informative words admit multiple plausible alternatives with greater uncertainty. Based on this intuition, we propose Entropy-guided Token Weighting (ETW), a token-level unlearning regularizer that uses entropy of the predictive distribution as a proxy for token informativeness. We demonstrate that informative tokens tend to have higher entropy, whereas structural tokens tend to have lower entropy. This behavior enables ETW to achieve more effective unlearning while better preserving model utility than existing token-level approaches.
Anthology ID:
2026.acl-long.1175
Volume:
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
25639–25655
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.acl-long.1175/
DOI:
Bibkey:
Cite (ACL):
Seunghee Koh, Sunghyun Baek, Youngdong Kim, and Junmo Kim. 2026. Forget What Matters, Keep the Rest: Selective Unlearning of Informative Tokens. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 25639–25655, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Forget What Matters, Keep the Rest: Selective Unlearning of Informative Tokens (Koh et al., ACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.acl-long.1175.pdf
Checklist:
 2026.acl-long.1175.checklist.pdf