Decoding-Unlearning: Fact Forgetting via Entropy-Guided Inference

Jingwen Pu; Mingjun Shi; Xinrui Ren; Yizhe Wang; Xinyu Zhang; Zhaokun Wang; Kun She

Decoding-Unlearning: Fact Forgetting via Entropy-Guided Inference

Jingwen Pu, Mingjun Shi, Xinrui Ren, Yizhe Wang, Xinyu Zhang, Zhaokun Wang, Kun She

Abstract

Large Language Models (LLMs) exhibit powerful capabilities but inevitably memorize sensitive information, raising privacy, copyright, and safety concerns. Existing LLM unlearning methods typically rely on updating model parameters. While effective, they are often limited in real-world scenarios: fine-tuning large-scale models is costly, may introduce potential irreversible risks, and depends on both forget and retain datasets, which are often difficult to obtain in full. To address these challenges, an ideal solution is to achieve unlearning at inference time. To this end, we propose SEGUE, a training-free, plug-and-play inference-time unlearning strategy. SEGUE employs a probe to detect queries involving forgettable concepts and applies entropy-guided decoding to suppress target knowledge, enabling controllable non-factual generation while preserving overall model capabilities. Experiments on the MUSE, RWKU, and WMDP datasets, covering copyright, entity, and potential-risk knowledge, show that SEGUE effectively balances sensitive knowledge suppression and generation quality, outperforming existing most inference-time unlearning methods.

Anthology ID:: 2026.acl-long.1850
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 39834–39860
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.1850/
DOI:
Bibkey:
Cite (ACL):: Jingwen Pu, Mingjun Shi, Xinrui Ren, Yizhe Wang, Xinyu Zhang, Zhaokun Wang, and Kun She. 2026. Decoding-Unlearning: Fact Forgetting via Entropy-Guided Inference. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 39834–39860, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Decoding-Unlearning: Fact Forgetting via Entropy-Guided Inference (Pu et al., ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.1850.pdf
Checklist:: 2026.acl-long.1850.checklist.pdf

PDF Cite Search Checklist Fix data