Abstract
Capsule networks have been shown to demonstrate good performance on structured data in the area of visual inference. In this paper we apply and compare simple shallow capsule networks for hierarchical multi-label text classification and show that they can perform superior to other neural networks, such as CNNs and LSTMs, and non-neural network architectures such as SVMs. For our experiments, we use the established Web of Science (WOS) dataset and introduce a new real-world scenario dataset, the BlurbGenreCollection (BGC). Our results confirm the hypothesis that capsule networks are especially advantageous for rare events and structurally diverse categories, which we attribute to their ability to combine latent encoded information.- Anthology ID:
- P19-2045
- Volume:
- Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop
- Month:
- July
- Year:
- 2019
- Address:
- Florence, Italy
- Editors:
- Fernando Alva-Manchego, Eunsol Choi, Daniel Khashabi
- Venue:
- ACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 323–330
- Language:
- URL:
- https://aclanthology.org/P19-2045
- DOI:
- 10.18653/v1/P19-2045
- Cite (ACL):
- Rami Aly, Steffen Remus, and Chris Biemann. 2019. Hierarchical Multi-label Classification of Text with Capsule Networks. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, pages 323–330, Florence, Italy. Association for Computational Linguistics.
- Cite (Informal):
- Hierarchical Multi-label Classification of Text with Capsule Networks (Aly et al., ACL 2019)
- PDF:
- https://preview.aclanthology.org/ml4al-ingestion/P19-2045.pdf
- Code
- uhh-lt/BlurbGenreCollection-HMC
- Data
- WOS