Transformer-Based Medical Statement Classification in Doctor-Patient Dialogues

Farnod Bahrololloomi; Johannes Luderschmidt; Biying Fu

Transformer-Based Medical Statement Classification in Doctor-Patient Dialogues

Farnod Bahrololloomi, Johannes Luderschmidt, Biying Fu

Abstract

The classification of medical statements in German doctor-patient interactions presents significant challenges for automated medical information extraction, particularly due to complex domain-specific terminology and the limited availability of specialized training data. To address this, we introduce a manually annotated dataset specifically designed for distinguishing medical from non-medical statements. This dataset incorporates the nuances of German medical terminology and provides a valuable foundation for further research in this domain. We systematically evaluate Transformer-based models and multimodal embedding techniques, comparing them against traditional embedding-based machine learning (ML) approaches and domain-specific models such as medBERT.de. Our empirical results show that Transformer-based architectures, such as the Sentence-BERT model combined with a support vector machine (SVM), achieve the highest accuracy of 79.58% and a weighted F1-Score of 78.81%, demonstrating an average performance improvement of up to 10% over domain-specific counterparts. Additionally, we highlight the potential of lightweight ML-models for resource-efficient deployment on mobile devices, enabling real-time medical information processing in practical settings. These findings emphasize the importance of embedding selection for optimizing classification performance in the medical domain and establish a robust foundation for the development of advanced, domain-adapted German language models.

Anthology ID:: 2025.bionlp-1.7
Volume:: ACL 2025
Month:: August
Year:: 2025
Address:: Viena, Austria
Editors:: Dina Demner-Fushman, Sophia Ananiadou, Makoto Miwa, Junichi Tsujii
Venues:: BioNLP | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 63–73
Language:
URL:: https://preview.aclanthology.org/acl25-workshop-ingestion/2025.bionlp-1.7/
DOI:
Bibkey:
Cite (ACL):: Farnod Bahrololloomi, Johannes Luderschmidt, and Biying Fu. 2025. Transformer-Based Medical Statement Classification in Doctor-Patient Dialogues. In ACL 2025, pages 63–73, Viena, Austria. Association for Computational Linguistics.
Cite (Informal):: Transformer-Based Medical Statement Classification in Doctor-Patient Dialogues (Bahrololloomi et al., BioNLP 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/acl25-workshop-ingestion/2025.bionlp-1.7.pdf
Supplementarymaterial:: 2025.bionlp-1.7.SupplementaryMaterial.zip
Supplementarymaterial:: 2025.bionlp-1.7.SupplementaryMaterial.txt

PDF Cite Search Supplementarymaterial Supplementarymaterial Fix data