Transformer-Based Medical Statement Classification in Doctor-Patient Dialogues

Farnod Bahrololloomi, Johannes Luderschmidt, Biying Fu


Abstract
The classification of medical statements in German doctor-patient interactions presents significant challenges for automated medical information extraction, particularly due to complex domain-specific terminology and the limited availability of specialized training data. To address this, we introduce a manually annotated dataset specifically designed for distinguishing medical from non-medical statements. This dataset incorporates the nuances of German medical terminology and provides a valuable foundation for further research in this domain. We systematically evaluate Transformer-based models and multimodal embedding techniques, comparing them against traditional embedding-based machine learning (ML) approaches and domain-specific models such as medBERT.de. Our empirical results show that Transformer-based architectures, such as the Sentence-BERT model combined with a support vector machine (SVM), achieve the highest accuracy of 79.58% and a weighted F1-Score of 78.81%, demonstrating an average performance improvement of up to 10% over domain-specific counterparts. Additionally, we highlight the potential of lightweight ML-models for resource-efficient deployment on mobile devices, enabling real-time medical information processing in practical settings. These findings emphasize the importance of embedding selection for optimizing classification performance in the medical domain and establish a robust foundation for the development of advanced, domain-adapted German language models.
Anthology ID:
2025.bionlp-1.7
Volume:
ACL 2025
Month:
August
Year:
2025
Address:
Viena, Austria
Editors:
Dina Demner-Fushman, Sophia Ananiadou, Makoto Miwa, Junichi Tsujii
Venues:
BioNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
63–73
Language:
URL:
https://preview.aclanthology.org/acl25-workshop-ingestion/2025.bionlp-1.7/
DOI:
Bibkey:
Cite (ACL):
Farnod Bahrololloomi, Johannes Luderschmidt, and Biying Fu. 2025. Transformer-Based Medical Statement Classification in Doctor-Patient Dialogues. In ACL 2025, pages 63–73, Viena, Austria. Association for Computational Linguistics.
Cite (Informal):
Transformer-Based Medical Statement Classification in Doctor-Patient Dialogues (Bahrololloomi et al., BioNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/acl25-workshop-ingestion/2025.bionlp-1.7.pdf
Supplementarymaterial:
 2025.bionlp-1.7.SupplementaryMaterial.zip
Supplementarymaterial:
 2025.bionlp-1.7.SupplementaryMaterial.txt