Document Retrieval and Claim Verification to Mitigate COVID-19 Misinformation

Megha Sundriyal, Ganeshan Malhotra, Md Shad Akhtar, Shubhashis Sengupta, Andrew Fano, Tanmoy Chakraborty


Abstract
During the COVID-19 pandemic, the spread of misinformation on online social media has grown exponentially. Unverified bogus claims on these platforms regularly mislead people, leading them to believe in half-baked truths. The current vogue is to employ manual fact-checkers to verify claims to combat this avalanche of misinformation. However, establishing such claims’ veracity is becoming increasingly challenging, partly due to the plethora of information available, which is difficult to process manually. Thus, it becomes imperative to verify claims automatically without human interventions. To cope up with this issue, we propose an automated claim verification solution encompassing two steps – document retrieval and veracity prediction. For the retrieval module, we employ a hybrid search-based system with BM25 as a base retriever and experiment with recent state-of-the-art transformer-based models for re-ranking. Furthermore, we use a BART-based textual entailment architecture to authenticate the retrieved documents in the later step. We report experimental findings, demonstrating that our retrieval module outperforms the best baseline system by 10.32 NDCG@100 points. We escort a demonstration to assess the efficacy and impact of our suggested solution. As a byproduct of this study, we present an open-source, easily deployable, and user-friendly Python API that the community can adopt.
Anthology ID:
2022.constraint-1.8
Volume:
Proceedings of the Workshop on Combating Online Hostile Posts in Regional Languages during Emergency Situations
Month:
May
Year:
2022
Address:
Dublin, Ireland
Venue:
CONSTRAINT
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
66–74
Language:
URL:
https://aclanthology.org/2022.constraint-1.8
DOI:
10.18653/v1/2022.constraint-1.8
Bibkey:
Cite (ACL):
Megha Sundriyal, Ganeshan Malhotra, Md Shad Akhtar, Shubhashis Sengupta, Andrew Fano, and Tanmoy Chakraborty. 2022. Document Retrieval and Claim Verification to Mitigate COVID-19 Misinformation. In Proceedings of the Workshop on Combating Online Hostile Posts in Regional Languages during Emergency Situations, pages 66–74, Dublin, Ireland. Association for Computational Linguistics.
Cite (Informal):
Document Retrieval and Claim Verification to Mitigate COVID-19 Misinformation (Sundriyal et al., CONSTRAINT 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/auto-file-uploads/2022.constraint-1.8.pdf
Video:
 https://preview.aclanthology.org/auto-file-uploads/2022.constraint-1.8.mp4
Data
CORD-19FEVER