Tw-StAR at SemEval-2018 Task 1: Preprocessing Impact on Multi-label Emotion Classification
Hala Mulki, Chedi Bechikh Ali, Hatem Haddad, Ismail Babaoğlu
Abstract
In this paper, we describe our contribution in SemEval-2018 contest. We tackled task 1 “Affect in Tweets”, subtask E-c “Detecting Emotions (multi-label classification)”. A multilabel classification system Tw-StAR was developed to recognize the emotions embedded in Arabic, English and Spanish tweets. To handle the multi-label classification problem via traditional classifiers, we employed the binary relevance transformation strategy while a TF-IDF scheme was used to generate the tweets’ features. We investigated using single and combinations of several preprocessing tasks to further improve the performance. The results showed that specific combinations of preprocessing tasks could significantly improve the evaluation measures. This has been later emphasized by the official results as our system ranked 3rd for both Arabic and Spanish datasets and 14th for the English dataset.- Anthology ID:
- S18-1024
- Volume:
- Proceedings of the 12th International Workshop on Semantic Evaluation
- Month:
- June
- Year:
- 2018
- Address:
- New Orleans, Louisiana
- Venues:
- SemEval | *SEM
- SIGs:
- SIGLEX | SIGSEM
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 167–171
- Language:
- URL:
- https://aclanthology.org/S18-1024
- DOI:
- 10.18653/v1/S18-1024
- Cite (ACL):
- Hala Mulki, Chedi Bechikh Ali, Hatem Haddad, and Ismail Babaoğlu. 2018. Tw-StAR at SemEval-2018 Task 1: Preprocessing Impact on Multi-label Emotion Classification. In Proceedings of the 12th International Workshop on Semantic Evaluation, pages 167–171, New Orleans, Louisiana. Association for Computational Linguistics.
- Cite (Informal):
- Tw-StAR at SemEval-2018 Task 1: Preprocessing Impact on Multi-label Emotion Classification (Mulki et al., SemEval-*SEM 2018)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/S18-1024.pdf