Tobias Strapatsas


2025

pdf bib
One Size Fits None: Rethinking Fairness in Medical AI
Roland Roller | Michael Hahn | Ajay Madhavan Ravichandran | Bilgin Osmanodja | Florian Oetke | Zeineb Sassi | Aljoscha Burchardt | Klaus Netter | Klemens Budde | Anne Herrmann | Tobias Strapatsas | Peter Dabrock | Sebastian Möller
Proceedings of the 6th Workshop on Gender Bias in Natural Language Processing (GeBNLP)

Machine learning (ML) models are increasingly used to support clinical decision-making. However, real-world medical datasets are often noisy, incomplete, and imbalanced, leading to performance disparities across patient subgroups. These differences raise fairness concerns, particularly when they reinforce existing disadvantages for marginalized groups. In this work, we analyze several medical prediction tasks and demonstrate how model performance varies with patient characteristics. While ML models may demonstrate good overall performance, we argue that subgroup-level evaluation is essential before integrating them into clinical workflows. By conducting a performance analysis at the subgroup level, differences can be clearly identified—allowing, on the one hand, for performance disparities to be considered in clinical practice, and on the other hand, for these insights to inform the responsible development of more effective models. Thereby, our work contributes to a practical discussion around the subgroup-sensitive development and deployment of medical ML models and the interconnectedness of fairness and transparency.

2024

pdf bib
Towards ML-supported Triage Prediction in Real-World Emergency Room Scenarios
Faraz Maschhur | Klaus Netter | Sven Schmeier | Katrin Ostermann | Rimantas Palunis | Tobias Strapatsas | Roland Roller
Proceedings of the 23rd Workshop on Biomedical Natural Language Processing

In emergency wards, patients are prioritized by clinical staff according to the urgency of their medical condition. This can be achieved by categorizing patients into different labels of urgency ranging from immediate to not urgent. However, in order to train machine learning models offering support in this regard, there is more than approaching this as a multi-class problem. This work explores the challenges and obstacles of automatic triage using anonymized real-world multi-modal ambulance data in Germany.