That is a Known Lie: Detecting Previously Fact-Checked Claims

Shaden Shaar; Nikolay Babulkov; Giovanni Da San Martino; Preslav Nakov

doi:10.18653/v1/2020.acl-main.332

That is a Known Lie: Detecting Previously Fact-Checked Claims

Shaden Shaar, Nikolay Babulkov, Giovanni Da San Martino, Preslav Nakov

Abstract

The recent proliferation of ”fake news” has triggered a number of responses, most notably the emergence of several manual fact-checking initiatives. As a result and over time, a large number of fact-checked claims have been accumulated, which increases the likelihood that a new claim in social media or a new statement by a politician might have already been fact-checked by some trusted fact-checking organization, as viral claims often come back after a while in social media, and politicians like to repeat their favorite statements, true or false, over and over again. As manual fact-checking is very time-consuming (and fully automatic fact-checking has credibility issues), it is important to try to save this effort and to avoid wasting time on claims that have already been fact-checked. Interestingly, despite the importance of the task, it has been largely ignored by the research community so far. Here, we aim to bridge this gap. In particular, we formulate the task and we discuss how it relates to, but also differs from, previous work. We further create a specialized dataset, which we release to the research community. Finally, we present learning-to-rank experiments that demonstrate sizable improvements over state-of-the-art retrieval and textual similarity approaches.

Anthology ID:: 2020.acl-main.332
Volume:: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
Month:: July
Year:: 2020
Address:: Online
Editors:: Dan Jurafsky, Joyce Chai, Natalie Schluter, Joel Tetreault
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 3607–3618
Language:
URL:: https://aclanthology.org/2020.acl-main.332
DOI:: 10.18653/v1/2020.acl-main.332
Bibkey:
Cite (ACL):: Shaden Shaar, Nikolay Babulkov, Giovanni Da San Martino, and Preslav Nakov. 2020. That is a Known Lie: Detecting Previously Fact-Checked Claims. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 3607–3618, Online. Association for Computational Linguistics.
Cite (Informal):: That is a Known Lie: Detecting Previously Fact-Checked Claims (Shaar et al., ACL 2020)
Copy Citation:
PDF:: https://preview.aclanthology.org/nschneid-patch-5/2020.acl-main.332.pdf
Video:: http://slideslive.com/38929333
Code: sshaar/That-is-a-Known-Lie + additional community code
Data: GLUE

PDF Search Code Video