Emotion Classification in a Resource Constrained Language Using Transformer-based Approach
Avishek Das, Omar Sharif, Mohammed Moshiul Hoque, Iqbal H. Sarker
Abstract
Although research on emotion classification has significantly progressed in high-resource languages, it is still infancy for resource-constrained languages like Bengali. However, unavailability of necessary language processing tools and deficiency of benchmark corpora makes the emotion classification task in Bengali more challenging and complicated. This work proposes a transformer-based technique to classify the Bengali text into one of the six basic emotions: anger, fear, disgust, sadness, joy, and surprise. A Bengali emotion corpus consists of 6243 texts is developed for the classification task. Experimentation carried out using various machine learning (LR, RF, MNB, SVM), deep neural networks (CNN, BiLSTM, CNN+BiLSTM) and transformer (Bangla-BERT, m-BERT, XLM-R) based approaches. Experimental outcomes indicate that XLM-R outdoes all other techniques by achieving the highest weighted f_1-score of 69.73% on the test data.- Anthology ID:
- 2021.naacl-srw.19
- Volume:
- Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop
- Month:
- June
- Year:
- 2021
- Address:
- Online
- Editors:
- Esin Durmus, Vivek Gupta, Nelson Liu, Nanyun Peng, Yu Su
- Venue:
- NAACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 150–158
- Language:
- URL:
- https://aclanthology.org/2021.naacl-srw.19
- DOI:
- 10.18653/v1/2021.naacl-srw.19
- Cite (ACL):
- Avishek Das, Omar Sharif, Mohammed Moshiul Hoque, and Iqbal H. Sarker. 2021. Emotion Classification in a Resource Constrained Language Using Transformer-based Approach. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop, pages 150–158, Online. Association for Computational Linguistics.
- Cite (Informal):
- Emotion Classification in a Resource Constrained Language Using Transformer-based Approach (Das et al., NAACL 2021)
- PDF:
- https://preview.aclanthology.org/ml4al-ingestion/2021.naacl-srw.19.pdf
- Code
- sagorbrur/bangla-bert + additional community code