Linguistic Blind Spots in Clinical Decision Extraction

Mohamed Elgaar, Hadi Amiri


Abstract
Extracting medical decisions from clinical notes is a key step for clinical decision support and patient-facing care summaries. We study how the linguistic characteristics of clinical decisions vary across decision categories and whether these differences explain extraction failures. Using MedDec discharge summaries annotated with decision categories from the Decision Identification and Classification Taxonomy for Use in Medicine (DICTUM), we compute seven linguistic indices for each decision span and analyze span-level extraction recall of a standard transformer model. We find clear category-specific signatures: drug-related and problem-defining decisions are entity-dense and telegraphic, whereas advice and precaution decisions contain more narrative, with higher stopword and pronoun proportions and more frequent hedging and negation cues. On the validation split, exact-match recall is 48%, with large gaps across linguistic strata: recall drops from 58% to 24% from the lowest to highest stopword-proportion bins, and spans containing hedging or negation cues are less likely to be recovered. Under a relaxed overlap-based match criterion, recall increases to 71%, indicating that many errors are span boundary disagreements rather than complete misses. Overall, narrative-style spans–common in advice and precaution decisions–are a consistent blind spot under exact matching, suggesting that downstream systems should incorporate boundary-tolerant evaluation and extraction strategies for clinical decisions.
Anthology ID:
2026.healing-1.4
Volume:
Proceedings of the 1st Workshop on Linguistic Analysis for Health (HeaLing 2026)
Month:
March
Year:
2026
Address:
Rabat, Morocco
Editors:
Vera Danilova, Murathan Kurfalı, Ylva Söderfeldt, Julia Reed, Andrew Burchell
Venues:
HeaLing | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
46–54
Language:
URL:
https://preview.aclanthology.org/ingest-eacl/2026.healing-1.4/
DOI:
Bibkey:
Cite (ACL):
Mohamed Elgaar and Hadi Amiri. 2026. Linguistic Blind Spots in Clinical Decision Extraction. In Proceedings of the 1st Workshop on Linguistic Analysis for Health (HeaLing 2026), pages 46–54, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):
Linguistic Blind Spots in Clinical Decision Extraction (Elgaar & Amiri, HeaLing 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-eacl/2026.healing-1.4.pdf