Supervised Neural Topic Modeling with Label Alignment

Ruihao Chen, Hegang Chen, Yuyin Lu, Yanghui Rao, Chunjiang Zhu


Abstract
Neural topic modeling is a scalable automated technique for text data mining. In various downstream tasks of topic modeling, it is preferred that the discovered topics well align with labels. However, due to the lack of guidance from labels, unsupervised neural topic models are less powerful in this situation. Existing supervised neural topic models often adopt a label-free prior to generate the latent document-topic distributions and use them to predict the labels and thus achieve label-topic alignment indirectly. Such a mechanism faces the following issues: 1) The label-free prior leads to topics blending the latent patterns of multiple labels; and 2) One is unable to intuitively identify the explicit relationships between labels and the discovered topics. To tackle these problems, we develop a novel supervised neural topic model which utilizes a chain-structured graphical model with a label-conditioned prior. Soft indicators are introduced to explicitly construct the label-topic relationships. To obtain well-organized label-topic relationships, we formalize an entropy-regularized optimal transport problem on the embedding space and model them as the transport plan. Moreover, our proposed method can be flexibly integrated with most existing unsupervised neural topic models. Experimental results on multiple datasets demonstrate that our model can greatly enhance the alignment between labels and topics while maintaining good topic quality.
Anthology ID:
2025.tacl-1.12
Volume:
Transactions of the Association for Computational Linguistics, Volume 13
Month:
Year:
2025
Address:
Cambridge, MA
Venue:
TACL
SIG:
Publisher:
MIT Press
Note:
Pages:
249–263
Language:
URL:
https://preview.aclanthology.org/add-iwsds-main-page/2025.tacl-1.12/
DOI:
10.1162/tacl_a_00738
Bibkey:
Cite (ACL):
Ruihao Chen, Hegang Chen, Yuyin Lu, Yanghui Rao, and Chunjiang Zhu. 2025. Supervised Neural Topic Modeling with Label Alignment. Transactions of the Association for Computational Linguistics, 13:249–263.
Cite (Informal):
Supervised Neural Topic Modeling with Label Alignment (Chen et al., TACL 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/add-iwsds-main-page/2025.tacl-1.12.pdf