An Ensemble Method with Sentiment Features and Clustering Support

Huy Tien Nguyen, Minh Le Nguyen


Abstract
Deep learning models have recently been applied successfully in natural language processing, especially sentiment analysis. Each deep learning model has a particular advantage, but it is difficult to combine these advantages into one model, especially in the area of sentiment analysis. In our approach, Convolutional Neural Network (CNN) and Long Short Term Memory (LSTM) were utilized to learn sentiment-specific features in a freezing scheme. This scenario provides a novel and efficient way for integrating advantages of deep learning models. In addition, we also grouped documents into clusters by their similarity and applied the prediction score of Naive Bayes SVM (NBSVM) method to boost the classification accuracy of each group. The experiments show that our method achieves the state-of-the-art performance on two well-known datasets: IMDB large movie reviews for document level and Pang & Lee movie reviews for sentence level.
Anthology ID:
I17-1065
Volume:
Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
Month:
November
Year:
2017
Address:
Taipei, Taiwan
Editors:
Greg Kondrak, Taro Watanabe
Venue:
IJCNLP
SIG:
Publisher:
Asian Federation of Natural Language Processing
Note:
Pages:
644–653
Language:
URL:
https://aclanthology.org/I17-1065
DOI:
Bibkey:
Cite (ACL):
Huy Tien Nguyen and Minh Le Nguyen. 2017. An Ensemble Method with Sentiment Features and Clustering Support. In Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 644–653, Taipei, Taiwan. Asian Federation of Natural Language Processing.
Cite (Informal):
An Ensemble Method with Sentiment Features and Clustering Support (Nguyen & Nguyen, IJCNLP 2017)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-3/I17-1065.pdf
Data
IMDb Movie Reviews