REACT: Representation Extraction And Controllable Tuning to Overcome Overfitting in LLM Knowledge Editing

Haitian Zhong; Yuhuan Liu; Ziyang Xu; Guofan Liu; Qiang Liu; Shu Wu; Zhe Zhao; Liang Wang; Tieniu Tan

REACT: Representation Extraction And Controllable Tuning to Overcome Overfitting in LLM Knowledge Editing

Haitian Zhong, Yuhuan Liu, Ziyang Xu, Guofan Liu, Qiang Liu, Shu Wu, Zhe Zhao, Liang Wang, Tieniu Tan

Abstract

Large language model editing methods frequently suffer from overfitting, wherein factual updates can propagate beyond their intended scope, overemphasizing the edited target even when it’s contextually inappropriate. To address this challenge, we introduce REACT (Representation Extraction And Controllable Tuning), a unified two-phase framework designed for precise and controllable knowledge editing. In the initial phase, we utilize tailored stimuli to extract latent factual representations and apply Principal Component Analysis with a simple learnbale linear transformation to compute a directional “belief shift” vector for each instance. In the second phase, we apply controllable perturbations to hidden states using the obtained vector with a magnitude scalar, gated by a pre-trained classifier that permits edits only when contextually necessary. Relevant experiments on EVOKE benchmarks demonstrate that REACT significantly reduces overfitting across nearly all evaluation metrics, and experiments on COUNTERFACT and MQuAKE shows that our method preserves balanced basic editing performance (reliability, locality, and generality) under diverse editing scenarios.

Anthology ID:: 2025.emnlp-main.860
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 16994–17011
Language:
URL:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.860/
DOI:
Bibkey:
Cite (ACL):: Haitian Zhong, Yuhuan Liu, Ziyang Xu, Guofan Liu, Qiang Liu, Shu Wu, Zhe Zhao, Liang Wang, and Tieniu Tan. 2025. REACT: Representation Extraction And Controllable Tuning to Overcome Overfitting in LLM Knowledge Editing. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 16994–17011, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: REACT: Representation Extraction And Controllable Tuning to Overcome Overfitting in LLM Knowledge Editing (Zhong et al., EMNLP 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.860.pdf
Checklist:: 2025.emnlp-main.860.checklist.pdf

PDF Cite Search Checklist Fix data