Abstract
This paper presents the system that we have developed while solving this shared task on violence inciting text detection in Bangla. We explain both the traditional and the recent approaches that we have used to make our models learn. Our proposed system helps to classify if the given text contains any threat. We studied the impact of data augmentation when there is a limited dataset available. Our quantitative results show that finetuning a multilingual-e5-base model performed the best in our task compared to other transformer-based architectures. We obtained a macro F1 of 68.11% in the test set and our performance in this shared task is ranked at 23 in the leaderboard.- Anthology ID:
- 2023.banglalp-1.17
- Volume:
- Proceedings of the First Workshop on Bangla Language Processing (BLP-2023)
- Month:
- December
- Year:
- 2023
- Address:
- Singapore
- Editors:
- Firoj Alam, Sudipta Kar, Shammur Absar Chowdhury, Farig Sadeque, Ruhul Amin
- Venue:
- BanglaLP
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 163–167
- Language:
- URL:
- https://aclanthology.org/2023.banglalp-1.17
- DOI:
- 10.18653/v1/2023.banglalp-1.17
- Cite (ACL):
- Saumajit Saha and Albert Nanda. 2023. BanglaNLP at BLP-2023 Task 1: Benchmarking different Transformer Models for Violence Inciting Text Detection in Bangla. In Proceedings of the First Workshop on Bangla Language Processing (BLP-2023), pages 163–167, Singapore. Association for Computational Linguistics.
- Cite (Informal):
- BanglaNLP at BLP-2023 Task 1: Benchmarking different Transformer Models for Violence Inciting Text Detection in Bangla (Saha & Nanda, BanglaLP 2023)
- PDF:
- https://preview.aclanthology.org/emnlp22-frontmatter/2023.banglalp-1.17.pdf