Cluster-Gated Convolutional Neural Network for Short Text Classification

Haidong Zhang, Wancheng Ni, Meijing Zhao, Ziqi Lin


Abstract
Text classification plays a crucial role for understanding natural language in a wide range of applications. Most existing approaches mainly focus on long text classification (e.g., blogs, documents, paragraphs). However, they cannot easily be applied to short text because of its sparsity and lack of context. In this paper, we propose a new model called cluster-gated convolutional neural network (CGCNN), which jointly explores word-level clustering and text classification in an end-to-end manner. Specifically, the proposed model firstly uses a bi-directional long short-term memory to learn word representations. Then, it leverages a soft clustering method to explore their semantic relation with the cluster centers, and takes linear transformation on text representations. It develops a cluster-dependent gated convolutional layer to further control the cluster-dependent feature flows. Experimental results on five commonly used datasets show that our model outperforms state-of-the-art models.
Anthology ID:
K19-1094
Volume:
Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL)
Month:
November
Year:
2019
Address:
Hong Kong, China
Editors:
Mohit Bansal, Aline Villavicencio
Venue:
CoNLL
SIG:
SIGNLL
Publisher:
Association for Computational Linguistics
Note:
Pages:
1002–1011
Language:
URL:
https://aclanthology.org/K19-1094
DOI:
10.18653/v1/K19-1094
Bibkey:
Cite (ACL):
Haidong Zhang, Wancheng Ni, Meijing Zhao, and Ziqi Lin. 2019. Cluster-Gated Convolutional Neural Network for Short Text Classification. In Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL), pages 1002–1011, Hong Kong, China. Association for Computational Linguistics.
Cite (Informal):
Cluster-Gated Convolutional Neural Network for Short Text Classification (Zhang et al., CoNLL 2019)
Copy Citation:
PDF:
https://preview.aclanthology.org/dois-2013-emnlp/K19-1094.pdf