Multimodal or Text? Retrieval or BERT? Benchmarking Classifiers for the Shared Task on Hateful Memes
Abstract
The Shared Task on Hateful Memes is a challenge that aims at the detection of hateful content in memes by inviting the implementation of systems that understand memes, potentially by combining image and textual information. The challenge consists of three detection tasks: hate, protected category and attack type. The first is a binary classification task, while the other two are multi-label classification tasks. Our participation included a text-based BERT baseline (TxtBERT), the same but adding information from the image (ImgBERT), and neural retrieval approaches. We also experimented with retrieval augmented classification models. We found that an ensemble of TxtBERT and ImgBERT achieves the best performance in terms of ROC AUC score in two out of the three tasks on our development set.- Anthology ID:
- 2021.woah-1.24
- Volume:
- Proceedings of the 5th Workshop on Online Abuse and Harms (WOAH 2021)
- Month:
- August
- Year:
- 2021
- Address:
- Online
- Editors:
- Aida Mostafazadeh Davani, Douwe Kiela, Mathias Lambert, Bertie Vidgen, Vinodkumar Prabhakaran, Zeerak Waseem
- Venue:
- WOAH
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 220–225
- Language:
- URL:
- https://aclanthology.org/2021.woah-1.24
- DOI:
- 10.18653/v1/2021.woah-1.24
- Cite (ACL):
- Vasiliki Kougia and John Pavlopoulos. 2021. Multimodal or Text? Retrieval or BERT? Benchmarking Classifiers for the Shared Task on Hateful Memes. In Proceedings of the 5th Workshop on Online Abuse and Harms (WOAH 2021), pages 220–225, Online. Association for Computational Linguistics.
- Cite (Informal):
- Multimodal or Text? Retrieval or BERT? Benchmarking Classifiers for the Shared Task on Hateful Memes (Kougia & Pavlopoulos, WOAH 2021)
- PDF:
- https://preview.aclanthology.org/naacl-24-ws-corrections/2021.woah-1.24.pdf
- Data
- Hateful Memes, MS COCO