Reason-KE++: Aligning the Process, Not Just the Outcome, for Faithful LLM Knowledge Editing

Yuchen Wu; Liang Ding; Li Shen; Dacheng Tao

Reason-KE++: Aligning the Process, Not Just the Outcome, for Faithful LLM Knowledge Editing

Yuchen Wu, Liang Ding, Li Shen, Dacheng Tao

Abstract

Aligning Large Language Models (LLMs) to be faithful to new knowledge in complex, multi-hop reasoning tasks is a critical, yet unsolved, challenge. We find that SFT-based methods, e.g., Reason-KE, while state-of-the-art, suffer from a "faithfulness gap": they optimize for format mimicry rather than sound reasoning. This gap enables the LLM’s powerful parametric priors to override new contextual facts, resulting in critical factual hallucinations (e.g., incorrectly reasoning "Houston" from "NASA" despite an explicit edit). To solve this core LLM alignment problem, we propose **Reason-KE++**, an SFT+RL framework that instills process-level faithfulness. Its core is a Stage-aware Reward mechanism that provides dense supervision for intermediate reasoning steps (e.g., Decomposition, Sub-answer Correctness). Crucially, we identify that naive outcome-only RL is a deceptive trap for LLM alignment: it collapses reasoning integrity (e.g., 19.00% Hop acc) while superficially boosting final accuracy. Our process-aware framework sets **a new SOTA of 95.48%** on MQUAKE-CF-3k (+5.28%), demonstrating that for complex tasks, aligning the reasoning process is essential for building trustworthy LLMs.

Anthology ID:: 2026.findings-acl.1137
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 22639–22655
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1137/
DOI:
Bibkey:
Cite (ACL):: Yuchen Wu, Liang Ding, Li Shen, and Dacheng Tao. 2026. Reason-KE++: Aligning the Process, Not Just the Outcome, for Faithful LLM Knowledge Editing. In Findings of the Association for Computational Linguistics: ACL 2026, pages 22639–22655, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Reason-KE++: Aligning the Process, Not Just the Outcome, for Faithful LLM Knowledge Editing (Wu et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1137.pdf
Checklist:: 2026.findings-acl.1137.checklist.pdf

PDF Cite Search Checklist Fix data