An Efficient BERT Based Approach to Detect Aggression and Misogyny

Sandip Dutta, Utso Majumder, Sudip Naskar


Abstract
Social media is bustling with ever growing cases of trolling, aggression and hate. A huge amount of social media data is generated each day which is insurmountable for manual inspection. In this work, we propose an efficient and fast method to detect aggression and misogyny in social media texts. We use data from the Second Workshop on Trolling, Aggression and Cyber Bullying for our task. We employ a BERT based model to augment our data. Next we employ Tf-Idf and XGBoost for detecting aggression and misogyny. Our model achieves 0.73 and 0.85 Weighted F1 Scores on the 2 prediction tasks, which are comparable to the state of the art. However, the training time, model size and resource requirements of our model are drastically lower compared to the state of the art models, making our model useful for fast inference.
Anthology ID:
2021.icon-main.60
Volume:
Proceedings of the 18th International Conference on Natural Language Processing (ICON)
Month:
December
Year:
2021
Address:
National Institute of Technology Silchar, Silchar, India
Editors:
Sivaji Bandyopadhyay, Sobha Lalitha Devi, Pushpak Bhattacharyya
Venue:
ICON
SIG:
Publisher:
NLP Association of India (NLPAI)
Note:
Pages:
493–498
Language:
URL:
https://aclanthology.org/2021.icon-main.60
DOI:
Bibkey:
Cite (ACL):
Sandip Dutta, Utso Majumder, and Sudip Naskar. 2021. An Efficient BERT Based Approach to Detect Aggression and Misogyny. In Proceedings of the 18th International Conference on Natural Language Processing (ICON), pages 493–498, National Institute of Technology Silchar, Silchar, India. NLP Association of India (NLPAI).
Cite (Informal):
An Efficient BERT Based Approach to Detect Aggression and Misogyny (Dutta et al., ICON 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/emnlp22-frontmatter/2021.icon-main.60.pdf