ASU_OPTO at OSACT4 - Offensive Language Detection for Arabic text

Amr Keleg, Samhaa R. El-Beltagy, Mahmoud Khalil


Abstract
In the past years, toxic comments and offensive speech are polluting the internet and manual inspection of these comments is becoming a tiresome task to manage. Having a machine learning based model that is able to filter offensive Arabic content is of high need nowadays. In this paper, we describe the model that was submitted to the Shared Task on Offensive Language Detection that is organized by (The 4th Workshop on Open-Source Arabic Corpora and Processing Tools). Our model makes use transformer based model (BERT) to detect offensive content. We came in the fourth place in subtask A (detecting Offensive Speech) and in the third place in subtask B (detecting Hate Speech).
Anthology ID:
2020.osact-1.10
Volume:
Proceedings of the 4th Workshop on Open-Source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection
Month:
May
Year:
2020
Address:
Marseille, France
Editors:
Hend Al-Khalifa, Walid Magdy, Kareem Darwish, Tamer Elsayed, Hamdy Mubarak
Venue:
OSACT
SIG:
SIGARAB
Publisher:
European Language Resource Association
Note:
Pages:
66–70
Language:
English
URL:
https://preview.aclanthology.org/icon-24-ingestion/2020.osact-1.10/
DOI:
Bibkey:
Cite (ACL):
Amr Keleg, Samhaa R. El-Beltagy, and Mahmoud Khalil. 2020. ASU_OPTO at OSACT4 - Offensive Language Detection for Arabic text. In Proceedings of the 4th Workshop on Open-Source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection, pages 66–70, Marseille, France. European Language Resource Association.
Cite (Informal):
ASU_OPTO at OSACT4 - Offensive Language Detection for Arabic text (Keleg et al., OSACT 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/icon-24-ingestion/2020.osact-1.10.pdf