TECHSSN at SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Tweets using Deep Neural Networks
Logesh Balasubramanian, Harshini Sathish Kumar, Geetika Bandlamudi, Dyaneswaran Sivasankaran, Rajalakshmi Sivanaiah, Angel Deborah Suseelan, Sakaya Milton Rajendram, Mirnalinee Thanka Nadar Thanagathai
Abstract
Task 6 of SemEval 2019 involves identifying and categorizing offensive language in social media. The systems developed by TECHSSN team uses multi-level classification techniques. We have developed two systems. In the first system, the first level of classification is done by a multi-branch 2D CNN classifier with Google’s pre-trained Word2Vec embedding and the second level of classification by string matching technique supported by offensive and bad words dictionary. The second system uses a multi-branch 1D CNN classifier with Glove pre-trained embedding layer for the first level of classification and string matching for the second level of classification. Input data with a probability of less than 0.70 in the first level are passed on to the second level. The misclassified examples are classified correctly in the second level.- Anthology ID:
- S19-2132
- Volume:
- Proceedings of the 13th International Workshop on Semantic Evaluation
- Month:
- June
- Year:
- 2019
- Address:
- Minneapolis, Minnesota, USA
- Editors:
- Jonathan May, Ekaterina Shutova, Aurelie Herbelot, Xiaodan Zhu, Marianna Apidianaki, Saif M. Mohammad
- Venue:
- SemEval
- SIG:
- SIGLEX
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 753–758
- Language:
- URL:
- https://preview.aclanthology.org/sigarab-more-entries-6621/S19-2132/
- DOI:
- 10.18653/v1/S19-2132
- Cite (ACL):
- Logesh Balasubramanian, Harshini Sathish Kumar, Geetika Bandlamudi, Dyaneswaran Sivasankaran, Rajalakshmi Sivanaiah, Angel Deborah Suseelan, Sakaya Milton Rajendram, and Mirnalinee Thanka Nadar Thanagathai. 2019. TECHSSN at SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Tweets using Deep Neural Networks. In Proceedings of the 13th International Workshop on Semantic Evaluation, pages 753–758, Minneapolis, Minnesota, USA. Association for Computational Linguistics.
- Cite (Informal):
- TECHSSN at SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Tweets using Deep Neural Networks (Balasubramanian et al., SemEval 2019)
- PDF:
- https://preview.aclanthology.org/sigarab-more-entries-6621/S19-2132.pdf