Abstract
In this paper we present our contribution to SemEval-2018, a classifier for classifying multi-label emotions of Arabic and English tweets. We attempted “Affect in Tweets”, specifically Task E-c: Detecting Emotions (multi-label classification). Our method is based on preprocessing the tweets and creating word vectors combined with a self correction step to remove noise. We also make use of emotion specific thresholds. The final submission was selected upon the best performance achieved, selected when using a range of thresholds. Our system was evaluated on the Arabic and English datasets provided for the task by the competition organisers, where it ranked 2nd for the Arabic dataset (out of 14 entries) and 12th for the English dataset (out of 35 entries).- Anthology ID:
- S18-1030
- Volume:
- Proceedings of the 12th International Workshop on Semantic Evaluation
- Month:
- June
- Year:
- 2018
- Address:
- New Orleans, Louisiana
- Venues:
- SemEval | *SEM
- SIGs:
- SIGLEX | SIGSEM
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 200–204
- Language:
- URL:
- https://aclanthology.org/S18-1030
- DOI:
- 10.18653/v1/S18-1030
- Cite (ACL):
- Tariq Ahmad, Allan Ramsay, and Hanady Ahmed. 2018. CENTEMENT at SemEval-2018 Task 1: Classification of Tweets using Multiple Thresholds with Self-correction and Weighted Conditional Probabilities. In Proceedings of the 12th International Workshop on Semantic Evaluation, pages 200–204, New Orleans, Louisiana. Association for Computational Linguistics.
- Cite (Informal):
- CENTEMENT at SemEval-2018 Task 1: Classification of Tweets using Multiple Thresholds with Self-correction and Weighted Conditional Probabilities (Ahmad et al., SemEval-*SEM 2018)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/S18-1030.pdf