Abstract
Text classification plays a crucial role for understanding natural language in a wide range of applications. Most existing approaches mainly focus on long text classification (e.g., blogs, documents, paragraphs). However, they cannot easily be applied to short text because of its sparsity and lack of context. In this paper, we propose a new model called cluster-gated convolutional neural network (CGCNN), which jointly explores word-level clustering and text classification in an end-to-end manner. Specifically, the proposed model firstly uses a bi-directional long short-term memory to learn word representations. Then, it leverages a soft clustering method to explore their semantic relation with the cluster centers, and takes linear transformation on text representations. It develops a cluster-dependent gated convolutional layer to further control the cluster-dependent feature flows. Experimental results on five commonly used datasets show that our model outperforms state-of-the-art models.- Anthology ID:
- K19-1094
- Volume:
- Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL)
- Month:
- November
- Year:
- 2019
- Address:
- Hong Kong, China
- Editors:
- Mohit Bansal, Aline Villavicencio
- Venue:
- CoNLL
- SIG:
- SIGNLL
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 1002–1011
- Language:
- URL:
- https://aclanthology.org/K19-1094
- DOI:
- 10.18653/v1/K19-1094
- Cite (ACL):
- Haidong Zhang, Wancheng Ni, Meijing Zhao, and Ziqi Lin. 2019. Cluster-Gated Convolutional Neural Network for Short Text Classification. In Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL), pages 1002–1011, Hong Kong, China. Association for Computational Linguistics.
- Cite (Informal):
- Cluster-Gated Convolutional Neural Network for Short Text Classification (Zhang et al., CoNLL 2019)
- PDF:
- https://preview.aclanthology.org/dois-2013-emnlp/K19-1094.pdf