How Grounded is Wikipedia? A Study on Structured Evidential Support and Retrieval

William Gantt Walden, Kathryn Ricci, Miriam Wanner, Zhengping Jiang, Chandler May, Rongkun Zhou, Benjamin Van Durme


Abstract
Wikipedia is a critical resource for modern NLP, serving as a rich repository of up-to-date and citation-backed information on a wide variety of subjects. The reliability of Wikipedia—its groundedness in its cited sources—is vital to this purpose. This work analyzes both how grounded Wikipedia is and how readily fine-grained grounding evidence can be retrieved. To this end, we introduce PeopleProfiles—a large-scale, multi-level dataset of claim support annotations on biographical Wikipedia articles. We show that: ~22% of claims in Wikipedia *lead* sections are unsupported by the article body; ~30% of annotated claims in the article *body* are unsupported by their (publicly accessible) sources; and real-world Wikipedia citation practices often differ from documented standards. Finally, we show that complex evidence retrieval remains a challenge—even for recent reasoning rerankers.
Anthology ID:
2026.findings-acl.19
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
400–420
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.19/
DOI:
Bibkey:
Cite (ACL):
William Gantt Walden, Kathryn Ricci, Miriam Wanner, Zhengping Jiang, Chandler May, Rongkun Zhou, and Benjamin Van Durme. 2026. How Grounded is Wikipedia? A Study on Structured Evidential Support and Retrieval. In Findings of the Association for Computational Linguistics: ACL 2026, pages 400–420, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
How Grounded is Wikipedia? A Study on Structured Evidential Support and Retrieval (Walden et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.19.pdf
Checklist:
 2026.findings-acl.19.checklist.pdf