How Grounded is Wikipedia? A Study on Structured Evidential Support and Retrieval
William Gantt Walden, Kathryn Ricci, Miriam Wanner, Zhengping Jiang, Chandler May, Rongkun Zhou, Benjamin Van Durme
Abstract
Wikipedia is a critical resource for modern NLP, serving as a rich repository of up-to-date and citation-backed information on a wide variety of subjects. The reliability of Wikipedia—its groundedness in its cited sources—is vital to this purpose. This work analyzes both how grounded Wikipedia is and how readily fine-grained grounding evidence can be retrieved. To this end, we introduce PeopleProfiles—a large-scale, multi-level dataset of claim support annotations on biographical Wikipedia articles. We show that: ~22% of claims in Wikipedia *lead* sections are unsupported by the article body; ~30% of annotated claims in the article *body* are unsupported by their (publicly accessible) sources; and real-world Wikipedia citation practices often differ from documented standards. Finally, we show that complex evidence retrieval remains a challenge—even for recent reasoning rerankers.- Anthology ID:
- 2026.findings-acl.19
- Volume:
- Findings of the Association for Computational Linguistics: ACL 2026
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 400–420
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.19/
- DOI:
- Cite (ACL):
- William Gantt Walden, Kathryn Ricci, Miriam Wanner, Zhengping Jiang, Chandler May, Rongkun Zhou, and Benjamin Van Durme. 2026. How Grounded is Wikipedia? A Study on Structured Evidential Support and Retrieval. In Findings of the Association for Computational Linguistics: ACL 2026, pages 400–420, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- How Grounded is Wikipedia? A Study on Structured Evidential Support and Retrieval (Walden et al., Findings 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.19.pdf