Su Somay

2025

Standardized patients (SPs) are essential for clinical reasoning assessments in medical education. This paper introduces evaluation metrics that apply to both human and simulated SP systems. The metrics are computed using two LLM-as-a-judge approaches that align with human evaluators on SP performance, enabling scalable formative clinical reasoning assessments.

Co-authors

Andrew Emerson 1
Keelan Evanini 1
Kevin Frome 1
Le An Ha 1
Polina Harik 1

Victoria Yaneva 1

Venues

aimecon1

Fix author