Eric Rudolph
2026
Transition-Matrix Regularization for Next Dialogue Act Prediction in Counselling Conversations
Eric Rudolph | Philipp Steigerwald | Jens Albrecht
Findings of the Association for Computational Linguistics: ACL 2026
Eric Rudolph | Philipp Steigerwald | Jens Albrecht
Findings of the Association for Computational Linguistics: ACL 2026
This paper studies how empirical dialogue-flow statistics can be incorporated into Next Dialogue Act Prediction (NDAP). A KL regularization term is proposed that aligns predicted act distributions with corpus-derived transition patterns. Evaluated on a 60-class German counselling taxonomy using 5-fold cross-validation, this improves macro-F1 by 9–42% relative depending on encoder and substantially improves dialogue-flow alignment. Cross-dataset validation on HOPE suggests that improvements transfer across languages and counselling domains. In systematic ablations across pretrained encoders and architectures, the findings indicate that transition regularization provides consistent gains and disproportionately benefits weaker baseline models. The results suggest that lightweight discourse-flow priors complement pretrained encoders, especially in fine-grained, data-sparse dialogue tasks.
Nürnberg NLP at PsyDefDetect: Multi-Axis Voter Ensembles for Psychological Defence Mechanism Classification
Philipp Steigerwald | Eric Rudolph | Jens Albrecht
Proceedings of the BioNLP 2026 (Shared Tasks)
Philipp Steigerwald | Eric Rudolph | Jens Albrecht
Proceedings of the BioNLP 2026 (Shared Tasks)
Detecting levels of psychological defence mechanisms in supportive conversations is inherently ambiguous. In the PsyDefDetect shared task at BioNLP 2026 the eight positive defence categories share surface language and differ only in pragmatic function and trained raters reach only moderate inter-annotator agreement. On such a task the decisive lever is not a stronger single model but error independence, since any single representation will waver on the overlapping defence boundaries. We translate this insight into a 9-voter ensemble spanning three orthogonal axes: class granularity (all nine classes for the gatekeeper, only the eight defence classes for the specialists), training method (generative and discriminative) and base model. The system reaches an F1 score of .420 on the hidden test set, placing first among 21 registered teams.