REFLEX: Self-Refining Explainable Fact-Checking via Verdict-Anchored Style Control

Chuyi Kong, Wei Gao, Jing Ma, Hongzhan Lin, Yuxi Sun


Abstract
The prevalence of fake news on social media calls for automated fact-checking systems that deliver not only accurate verdicts but also faithful explanations. However, existing large language model (LLM)-based methods often overlook deceptive misinformation styles in generated explanations, producing unfaithful rationales that may mislead human judgment. They also rely heavily on external knowledge sources, which can introduce hallucinations and incur substantial latency, undermining both reliability and responsiveness in real-time settings. To address these limitations, we propose REason-guided Fact-checking with Latent EXplanations (REFLEX), a self-refining framework that explicitly controls reasoning style by anchoring explanations to the predicted verdict. REFLEX leverages self-disagreement veracity signals between a backbone model and its fine-tuned variant to construct steering vectors, thereby naturally disentangling factual content from stylistic cues. Experiments on a real-world benchmark show that REFLEX achieves state-of-the-art performance under LLaMA-series models using only 465 self-refined samples. Owing to its transferability, REFLEX also yields gains of up to 7.54 Macro-F1 points on in-the-wild data. Further analysis shows that our method effectively mitigates faithful hallucination, leading to both more reliable explanations and more accurate verdicts than prior explainable fact-checking approaches.
Anthology ID:
2026.acl-long.202
Volume:
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
4403–4431
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.acl-long.202/
DOI:
Bibkey:
Cite (ACL):
Chuyi Kong, Wei Gao, Jing Ma, Hongzhan Lin, and Yuxi Sun. 2026. REFLEX: Self-Refining Explainable Fact-Checking via Verdict-Anchored Style Control. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 4403–4431, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
REFLEX: Self-Refining Explainable Fact-Checking via Verdict-Anchored Style Control (Kong et al., ACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.acl-long.202.pdf
Checklist:
 2026.acl-long.202.checklist.pdf