When retrieval outperforms generation: Dense evidence retrieval for scalable fake news detection

Alamgir Munir Qazi; John Philip McCrae; Jamal A. Nasir

When retrieval outperforms generation: Dense evidence retrieval for scalable fake news detection

Alamgir Munir Qazi, John Philip McCrae, Jamal Nasir

Abstract

9 The proliferation of misinformation necessitates robust yet computationally efficient fact verification systems. While current state-of-the-art approaches leverage Large Language Models (LLMs) for generating explanatory rationales, these methods face significant computational barriers and hallucination risks in real-world deployments. We present DeReC (Dense Retrieval Classification), a lightweight framework that demonstrates how general-purpose text embeddings can effectively replace autoregressive LLM-based approaches in fact verification tasks. By combining dense retrieval with specialized classification, our system achieves better accuracy while being significantly more efficient. DeReC outperforms explanation-generating LLMs in efficiency, reducing runtime by 95% on RAWFC (23 minutes 36 seconds compared to 454 minutes 12 seconds) and by 92% on LIAR-RAW (134 minutes 14 seconds compared to 1692 minutes 23 seconds), showcasing its effectiveness across varying dataset sizes. On the RAWFC dataset, DeReC achieves an F1 score of 65.58%, surpassing the state-of-the-art method L-Defense (61.20%). Our results demonstrate that carefully engineered retrieval-based systems can match or exceed LLM performance in specialized tasks while being significantly more practical for real-world deployment.

Anthology ID:: 2025.ldk-1.26
Volume:: Proceedings of the 5th Conference on Language, Data and Knowledge
Month:: September
Year:: 2025
Address:: Naples, Italy
Editors:: Mehwish Alam, Andon Tchechmedjiev, Jorge Gracia, Dagmar Gromann, Maria Pia di Buono, Johanna Monti, Maxim Ionov
Venues:: LDK | WS
SIG:
Publisher:: Unior Press
Note:
Pages:: 255–265
Language:
URL:: https://preview.aclanthology.org/ldl-25-ingestion/2025.ldk-1.26/
DOI:
Bibkey:
Cite (ACL):: Alamgir Munir Qazi, John Philip McCrae, and Jamal Nasir. 2025. When retrieval outperforms generation: Dense evidence retrieval for scalable fake news detection. In Proceedings of the 5th Conference on Language, Data and Knowledge, pages 255–265, Naples, Italy. Unior Press.
Cite (Informal):: When retrieval outperforms generation: Dense evidence retrieval for scalable fake news detection (Qazi et al., LDK 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ldl-25-ingestion/2025.ldk-1.26.pdf

PDF Cite Search Fix data