Abstract
This paper introduces the first emotion-annotated dataset for the Dari variant of Persian spoken in Afghanistan. The LetHerLearn dataset contains 7,600 tweets posted in reaction to the Taliban’s ban of women’s rights to education in 2022 and has been manually annotated according to Ekman’s emotion categories. We here detail the data collection and annotation process, present relevant dataset statistics as well as initial experiments on the resulting dataset, benchmarking a number of different neural architectures for the task of Dari emotion classification.- Anthology ID:
- 2023.wassa-1.24
- Volume:
- Proceedings of the 13th Workshop on Computational Approaches to Subjectivity, Sentiment, & Social Media Analysis
- Month:
- July
- Year:
- 2023
- Address:
- Toronto, Canada
- Editors:
- Jeremy Barnes, Orphée De Clercq, Roman Klinger
- Venue:
- WASSA
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 271–277
- Language:
- URL:
- https://preview.aclanthology.org/add_missing_videos/2023.wassa-1.24/
- DOI:
- 10.18653/v1/2023.wassa-1.24
- Cite (ACL):
- Mohammad Ali Hussiny and Lilja Øvrelid. 2023. Emotion Analysis of Tweets Banning Education in Afghanistan. In Proceedings of the 13th Workshop on Computational Approaches to Subjectivity, Sentiment, & Social Media Analysis, pages 271–277, Toronto, Canada. Association for Computational Linguistics.
- Cite (Informal):
- Emotion Analysis of Tweets Banning Education in Afghanistan (Hussiny & Øvrelid, WASSA 2023)
- PDF:
- https://preview.aclanthology.org/add_missing_videos/2023.wassa-1.24.pdf