Abstract
Offensive language has become pervasive in social media. In Offensive Language Identification tasks, it may be difficult to predict accurately only according to the surface words. So we try to dig deeper semantic information of text. This paper presents use an attention-based two layers bidirectional longshort memory neural network (BiLSTM) for semantic feature extraction. Additionally, a residual connection mechanism is used to synthesize two different deep features, and an emoji attention mechanism is used to extract semantic information of emojis in text. We participated in three sub-tasks of SemEval 2019 Task 6 as CN-HIT-MI.T team. Our macro-averaged F1-score in sub-task A is 0.768, ranking 28/103. We got 0.638 in sub-task B, ranking 30/75. In sub-task C, we got 0.549, ranking 22/65. We also tried some other methods of not submitting results.- Anthology ID:
- S19-2101
- Volume:
- Proceedings of the 13th International Workshop on Semantic Evaluation
- Month:
- June
- Year:
- 2019
- Address:
- Minneapolis, Minnesota, USA
- Venue:
- SemEval
- SIG:
- SIGLEX
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 564–570
- Language:
- URL:
- https://aclanthology.org/S19-2101
- DOI:
- 10.18653/v1/S19-2101
- Cite (ACL):
- Yaojie Zhang, Bing Xu, and Tiejun Zhao. 2019. CN-HIT-MI.T at SemEval-2019 Task 6: Offensive Language Identification Based on BiLSTM with Double Attention. In Proceedings of the 13th International Workshop on Semantic Evaluation, pages 564–570, Minneapolis, Minnesota, USA. Association for Computational Linguistics.
- Cite (Informal):
- CN-HIT-MI.T at SemEval-2019 Task 6: Offensive Language Identification Based on BiLSTM with Double Attention (Zhang et al., SemEval 2019)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/S19-2101.pdf