SpecialtyScribe: Enhancing SOAP note Scribing for Medical Specialties using LLM’s
Sagar Goyal, Eti Rastogi, Fen Zhao, Dong Yuan, Andrew Beinstein
Abstract
The healthcare industry has accumulated vast amounts of clinical data, much of which has traditionally been unstructured, including medical records, clinical data, patient communications, and visit notes. Clinician-patient conversations form a crucial part of medical records, with the resulting medical note serving as the ground truth for future interactions and treatment plans. Generating concise and accurate SOAP notes is critical for quality patient care and is especially challenging in specialty care, where relevance, clarity, and adherence to clinician preferences are paramount. These requirements make general-purpose LLMs unsuitable for producing high-quality specialty notes. While recent LLMs like GPT-4 and Sonnet 3.5 have shown promise, their high cost, size, latency, and privacy issues remain barriers for many healthcare providers.We introduce SpecialtyScribe, a modular pipeline for generating specialty-specific medical notes. It features three components: an Information Extractor to capture relevant data, a Context Retriever to verify and augment content from transcripts, and a Note Writer to produce high quality notes. Our framework and in-house models outperform similarly sized open-source models by over 12% on ROUGE metrics.Additionally, these models match top closed-source LLMs’ performance while being under 1% of their size. We specifically evaluate our framework for oncology, with the potential for adaptation to other specialties.- Anthology ID:
- 2025.cl4health-1.4
- Volume:
- Proceedings of the Second Workshop on Patient-Oriented Language Processing (CL4Health)
- Month:
- May
- Year:
- 2025
- Address:
- Albuquerque, New Mexico
- Editors:
- Sophia Ananiadou, Dina Demner-Fushman, Deepak Gupta, Paul Thompson
- Venues:
- CL4Health | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 34–45
- Language:
- URL:
- https://preview.aclanthology.org/fix-sig-urls/2025.cl4health-1.4/
- DOI:
- Cite (ACL):
- Sagar Goyal, Eti Rastogi, Fen Zhao, Dong Yuan, and Andrew Beinstein. 2025. SpecialtyScribe: Enhancing SOAP note Scribing for Medical Specialties using LLM’s. In Proceedings of the Second Workshop on Patient-Oriented Language Processing (CL4Health), pages 34–45, Albuquerque, New Mexico. Association for Computational Linguistics.
- Cite (Informal):
- SpecialtyScribe: Enhancing SOAP note Scribing for Medical Specialties using LLM’s (Goyal et al., CL4Health 2025)
- PDF:
- https://preview.aclanthology.org/fix-sig-urls/2025.cl4health-1.4.pdf