Amira Shoukry
2012
Preprocessing Egyptian Dialect Tweets for Sentiment Mining
Amira Shoukry
|
Ahmed Rafea
Fourth Workshop on Computational Approaches to Arabic-Script-based Languages
Research done on Arabic sentiment analysis is considered very limited almost in its early steps compared to other languages like English whether at document-level or sentence-level. In this paper, we test the effect of preprocessing (normalization, stemming, and stop words removal) on the performance of an Arabic sentiment analysis system using Arabic tweets from twitter. The sentiment (positive or negative) of the crawled tweets is analyzed to interpret the attitude of the public with regards to topic of interest. Using Twitter as the main source of data reflects the importance of the system for the Middle East region, which mostly speaks Arabic.