Reading Between the Lines: Toward Translating Verbose Patient-authored Messages into Clinician-Formulated Questions

Sarvesh Soni, Madeline Bittner, Dina Demner-Fushman


Abstract
Patient portal messages often embed clinical questions inside long, emotionally nuanced narratives, requiring clinicians to infer the underlying information need. We study the task of rewriting verbose patient-authored narratives into concise, clinician-interpreted questions framed as if querying an electronic health record (EHR) system. We evaluate a lightweight LLM-based rewrite pipeline that constrains outputs to 10-15 words and uses rule-based validation with regeneration. We test the approach on 140 distinct patient questions drawn from the ArchEHR-QA dataset and shared task. Each system output is double-annotated by two annotators for quality (Good/Ok/Bad) and error types (Generic, Malformed, Tangential, Hallucination). Results show that while models follow output constraints, they often produce overly generic or tangential questions, and occasional hallucinations introduce unsupported clinical details. Across both clinician-question and patient-narrative comparison settings, automatic metrics show substantial overlap across human quality labels; in pairwise meta-evaluation, BERTScore is the strongest proxy for human preferences. We release our code and annotations to support future work.
Anthology ID:
2026.bionlp-1.38
Volume:
BioNLP 2026
Month:
July
Year:
2026
Address:
San Diego, California
Editors:
Dina Demner-Fushman, Sophia Ananiadou, Kirk Roberts, Junichi Tsujii
Venues:
BioNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
481–489
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.bionlp-1.38/
DOI:
Bibkey:
Cite (ACL):
Sarvesh Soni, Madeline Bittner, and Dina Demner-Fushman. 2026. Reading Between the Lines: Toward Translating Verbose Patient-authored Messages into Clinician-Formulated Questions. In BioNLP 2026, pages 481–489, San Diego, California. Association for Computational Linguistics.
Cite (Informal):
Reading Between the Lines: Toward Translating Verbose Patient-authored Messages into Clinician-Formulated Questions (Soni et al., BioNLP 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.bionlp-1.38.pdf