FACT5: A Novel Benchmark and Pipeline for Nuanced Fact-Checking of Complex Statements

Shayan Chowdhury, Sunny Fang, Smaranda Muresan


Abstract
Fact-checking complex statements is integral to combating misinformation, but manual approaches are time-consuming, while automated approaches often oversimplify truthfulness into binary classifications and rely on resource-intensive models. This paper introduces: (i) FACT5, a curated dataset of 150 real-world statements with five ordinal classes of truthfulness, designed to capture the nuanced nature of factual accuracy and (ii) an open-source end-to-end pipeline using large language models (LLMs) that decomposes statements into atomic claims, generates targeted questions, retrieves evidence from the web, and produces justified verdicts. We evaluate our pipeline on FACT5 using Mistral-7B-v0.3 and Google’s Gemini-1.5-Flash. Our findings demonstrate significant improvements over baseline LLM performance, with Mistral-7B showing a 71.9% reduction in MSE for pass@3 evaluation. The FACT5 dataset, pipeline implementation, and evaluation framework are anonymized and provided at https://github.com/shayantist/FACT5/, and a demo of the pipeline can be interacted with at https://fact5check.streamlit.app/.
Anthology ID:
2025.fever-1.8
Volume:
Proceedings of the Eighth Fact Extraction and VERification Workshop (FEVER)
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Mubashara Akhtar, Rami Aly, Christos Christodoulopoulos, Oana Cocarascu, Zhijiang Guo, Arpit Mittal, Michael Schlichtkrull, James Thorne, Andreas Vlachos
Venues:
FEVER | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
101–117
Language:
URL:
https://preview.aclanthology.org/display_plenaries/2025.fever-1.8/
DOI:
Bibkey:
Cite (ACL):
Shayan Chowdhury, Sunny Fang, and Smaranda Muresan. 2025. FACT5: A Novel Benchmark and Pipeline for Nuanced Fact-Checking of Complex Statements. In Proceedings of the Eighth Fact Extraction and VERification Workshop (FEVER), pages 101–117, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
FACT5: A Novel Benchmark and Pipeline for Nuanced Fact-Checking of Complex Statements (Chowdhury et al., FEVER 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/display_plenaries/2025.fever-1.8.pdf