When Demographic Sensitivity Isn’t What It Seems: Baseline-Aware Counterfactual Audits for Clinical NLP

Hyunwoo Yoo


Abstract
Clinical NLP systems are increasingly used for triage support, prediction, and decision assistance in EHR-based settings, where demographic fairness is a critical concern. A common evaluation approach is counterfactual demographic perturbation: modifying attributes such as age or sex while holding clinical evidence fixed and measuring output changes. However, we show that such counterfactual audits can be misleading when interpreted in isolation. Across three clinical LLMs, we find that non-demographic control perturbations (e.g., paraphrases) often induce output variability comparable to or greater than demographic edits. This can contribute to overestimation or misinterpretation of demographic bias.To address this, we propose a baseline-aware audit framework that explicitly compares demographic perturbations against control baselines. Our analysis reveals that (i) label-level stability can mask substantial variation in generated rationales and recommendations, and (ii) age-based perturbations generally induce larger effects than sex-based ones in borderline cases. Crucially, we identify a high intrinsic instability ("noise floor"; 0.46–0.71 Jaccard instability) in clinical LLM generations, while additional matched-metric analyses show that demographic perturbations are often comparable to non-demographic baseline variability.These findings highlight a key limitation of existing fairness evaluations: without establishing appropriate baselines, apparent demographic sensitivity may be over- or mis-attributed to bias rather than broader generative instability. We argue that baseline-aware counterfactual audits, which explicitly compare demographic effects against intrinsic model noise, provide a more reliable lens for evaluating clinical NLP systems in high-stakes settings.
Anthology ID:
2026.bionlp-1.14
Volume:
BioNLP 2026
Month:
July
Year:
2026
Address:
San Diego, California
Editors:
Dina Demner-Fushman, Sophia Ananiadou, Kirk Roberts, Junichi Tsujii
Venues:
BioNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
141–155
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.bionlp-1.14/
DOI:
Bibkey:
Cite (ACL):
Hyunwoo Yoo. 2026. When Demographic Sensitivity Isn’t What It Seems: Baseline-Aware Counterfactual Audits for Clinical NLP. In BioNLP 2026, pages 141–155, San Diego, California. Association for Computational Linguistics.
Cite (Informal):
When Demographic Sensitivity Isn’t What It Seems: Baseline-Aware Counterfactual Audits for Clinical NLP (Yoo, BioNLP 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.bionlp-1.14.pdf