That is a Known Lie: Detecting Previously Fact-Checked Claims
Shaden Shaar, Nikolay Babulkov, Giovanni Da San Martino, Preslav Nakov
Abstract
The recent proliferation of ”fake news” has triggered a number of responses, most notably the emergence of several manual fact-checking initiatives. As a result and over time, a large number of fact-checked claims have been accumulated, which increases the likelihood that a new claim in social media or a new statement by a politician might have already been fact-checked by some trusted fact-checking organization, as viral claims often come back after a while in social media, and politicians like to repeat their favorite statements, true or false, over and over again. As manual fact-checking is very time-consuming (and fully automatic fact-checking has credibility issues), it is important to try to save this effort and to avoid wasting time on claims that have already been fact-checked. Interestingly, despite the importance of the task, it has been largely ignored by the research community so far. Here, we aim to bridge this gap. In particular, we formulate the task and we discuss how it relates to, but also differs from, previous work. We further create a specialized dataset, which we release to the research community. Finally, we present learning-to-rank experiments that demonstrate sizable improvements over state-of-the-art retrieval and textual similarity approaches.- Anthology ID:
- 2020.acl-main.332
- Volume:
- Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
- Month:
- July
- Year:
- 2020
- Address:
- Online
- Editors:
- Dan Jurafsky, Joyce Chai, Natalie Schluter, Joel Tetreault
- Venue:
- ACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 3607–3618
- Language:
- URL:
- https://aclanthology.org/2020.acl-main.332
- DOI:
- 10.18653/v1/2020.acl-main.332
- Cite (ACL):
- Shaden Shaar, Nikolay Babulkov, Giovanni Da San Martino, and Preslav Nakov. 2020. That is a Known Lie: Detecting Previously Fact-Checked Claims. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 3607–3618, Online. Association for Computational Linguistics.
- Cite (Informal):
- That is a Known Lie: Detecting Previously Fact-Checked Claims (Shaar et al., ACL 2020)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-3/2020.acl-main.332.pdf
- Code
- sshaar/That-is-a-Known-Lie + additional community code
- Data
- GLUE