Abstract
In this paper, we present the results obtained using bi-directional long short-term memory (BiLSTM) with and without attention and Logistic Regression (LR) models for SemEval-2019 Task 5 titled ”HatEval: Multilingual Detection of Hate Speech Against Immigrants and Women in Twitter”. This paper presents the results obtained for Subtask A for English language. The results of the BiLSTM and LR models are compared for two different types of preprocessing. One with no stemming performed and no stopwords removed. The other with stemming performed and stopwords removed. The BiLSTM model without attention performed the best for the first test, while the LR model with character n-grams performed the best for the second test. The BiLSTM model obtained an F1 score of 0.51 on the test set and obtained an official ranking of 8/71.- Anthology ID:
- S19-2065
- Volume:
- Proceedings of the 13th International Workshop on Semantic Evaluation
- Month:
- June
- Year:
- 2019
- Address:
- Minneapolis, Minnesota, USA
- Editors:
- Jonathan May, Ekaterina Shutova, Aurelie Herbelot, Xiaodan Zhu, Marianna Apidianaki, Saif M. Mohammad
- Venue:
- SemEval
- SIG:
- SIGLEX
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 371–376
- Language:
- URL:
- https://aclanthology.org/S19-2065
- DOI:
- 10.18653/v1/S19-2065
- Cite (ACL):
- Arup Baruah, Ferdous Barbhuiya, and Kuntal Dey. 2019. ABARUAH at SemEval-2019 Task 5 : Bi-directional LSTM for Hate Speech Detection. In Proceedings of the 13th International Workshop on Semantic Evaluation, pages 371–376, Minneapolis, Minnesota, USA. Association for Computational Linguistics.
- Cite (Informal):
- ABARUAH at SemEval-2019 Task 5 : Bi-directional LSTM for Hate Speech Detection (Baruah et al., SemEval 2019)
- PDF:
- https://preview.aclanthology.org/autopr/S19-2065.pdf