Jens Albrecht
2026
Transition-Matrix Regularization for Next Dialogue Act Prediction in Counselling Conversations
Eric Rudolph | Philipp Steigerwald | Jens Albrecht
Findings of the Association for Computational Linguistics: ACL 2026
Eric Rudolph | Philipp Steigerwald | Jens Albrecht
Findings of the Association for Computational Linguistics: ACL 2026
This paper studies how empirical dialogue-flow statistics can be incorporated into Next Dialogue Act Prediction (NDAP). A KL regularization term is proposed that aligns predicted act distributions with corpus-derived transition patterns. Evaluated on a 60-class German counselling taxonomy using 5-fold cross-validation, this improves macro-F1 by 9–42% relative depending on encoder and substantially improves dialogue-flow alignment. Cross-dataset validation on HOPE suggests that improvements transfer across languages and counselling domains. In systematic ablations across pretrained encoders and architectures, the findings indicate that transition regularization provides consistent gains and disproportionately benefits weaker baseline models. The results suggest that lightweight discourse-flow priors complement pretrained encoders, especially in fine-grained, data-sparse dialogue tasks.
2020
Using BERT for Qualitative Content Analysis in Psychosocial Online Counseling
Philipp Grandeit | Carolyn Haberkern | Maximiliane Lang | Jens Albrecht | Robert Lehmann
Proceedings of the Fourth Workshop on Natural Language Processing and Computational Social Science
Philipp Grandeit | Carolyn Haberkern | Maximiliane Lang | Jens Albrecht | Robert Lehmann
Proceedings of the Fourth Workshop on Natural Language Processing and Computational Social Science
Qualitative content analysis is a systematic method commonly used in the social sciences to analyze textual data from interviews or online discussions. However, this method usually requires high expertise and manual effort because human coders need to read, interpret, and manually annotate text passages. This is especially true if the system of categories used for annotation is complex and semantically rich. Therefore, qualitative content analysis could benefit greatly from automated coding. In this work, we investigate the usage of machine learning-based text classification models for automatic coding in the area of psycho-social online counseling. We developed a system of over 50 categories to analyze counseling conversations, labeled over 10.000 text passages manually, and evaluated the performance of different machine learning-based classifiers against human coders.