CN-HIT-MI.T at SemEval-2019 Task 6: Offensive Language Identification Based on BiLSTM with Double Attention

Yaojie Zhang, Bing Xu, Tiejun Zhao


Abstract
Offensive language has become pervasive in social media. In Offensive Language Identification tasks, it may be difficult to predict accurately only according to the surface words. So we try to dig deeper semantic information of text. This paper presents use an attention-based two layers bidirectional longshort memory neural network (BiLSTM) for semantic feature extraction. Additionally, a residual connection mechanism is used to synthesize two different deep features, and an emoji attention mechanism is used to extract semantic information of emojis in text. We participated in three sub-tasks of SemEval 2019 Task 6 as CN-HIT-MI.T team. Our macro-averaged F1-score in sub-task A is 0.768, ranking 28/103. We got 0.638 in sub-task B, ranking 30/75. In sub-task C, we got 0.549, ranking 22/65. We also tried some other methods of not submitting results.
Anthology ID:
S19-2101
Volume:
Proceedings of the 13th International Workshop on Semantic Evaluation
Month:
June
Year:
2019
Address:
Minneapolis, Minnesota, USA
Venue:
SemEval
SIG:
SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
564–570
Language:
URL:
https://aclanthology.org/S19-2101
DOI:
10.18653/v1/S19-2101
Bibkey:
Cite (ACL):
Yaojie Zhang, Bing Xu, and Tiejun Zhao. 2019. CN-HIT-MI.T at SemEval-2019 Task 6: Offensive Language Identification Based on BiLSTM with Double Attention. In Proceedings of the 13th International Workshop on Semantic Evaluation, pages 564–570, Minneapolis, Minnesota, USA. Association for Computational Linguistics.
Cite (Informal):
CN-HIT-MI.T at SemEval-2019 Task 6: Offensive Language Identification Based on BiLSTM with Double Attention (Zhang et al., SemEval 2019)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/S19-2101.pdf
Supplementary:
 S19-2101.Supplementary.zip