Neural Topic Modeling by Incorporating Document Relationship Graph

Deyu Zhou, Xuemeng Hu, Rui Wang


Abstract
Graph Neural Networks (GNNs) that capture the relationships between graph nodes via message passing have been a hot research direction in the natural language processing community. In this paper, we propose Graph Topic Model (GTM), a GNN based neural topic model that represents a corpus as a document relationship graph. Documents and words in the corpus become nodes in the graph and are connected based on document-word co-occurrences. By introducing the graph structure, the relationships between documents are established through their shared words and thus the topical representation of a document is enriched by aggregating information from its neighboring nodes using graph convolution. Extensive experiments on three datasets were conducted and the results demonstrate the effectiveness of the proposed approach.
Anthology ID:
2020.emnlp-main.310
Volume:
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Month:
November
Year:
2020
Address:
Online
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
3790–3796
Language:
URL:
https://aclanthology.org/2020.emnlp-main.310
DOI:
10.18653/v1/2020.emnlp-main.310
Bibkey:
Cite (ACL):
Deyu Zhou, Xuemeng Hu, and Rui Wang. 2020. Neural Topic Modeling by Incorporating Document Relationship Graph. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 3790–3796, Online. Association for Computational Linguistics.
Cite (Informal):
Neural Topic Modeling by Incorporating Document Relationship Graph (Zhou et al., EMNLP 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/update-css-js/2020.emnlp-main.310.pdf
Video:
 https://slideslive.com/38939189