LLMs (Almost) Never Abstain Under Medical Uncertainty

Alessio Cocchieri; Luca Ragazzi; Giuseppe Tagliavini; Gianluca Moro

LLMs (Almost) Never Abstain Under Medical Uncertainty

Alessio Cocchieri, Luca Ragazzi, Giuseppe Tagliavini, Gianluca Moro

Abstract

Medical multiple-choice question answering (MCQA) benchmarks implicitly assume that large language models (LLMs) should always commit to an answer. However, in clinical practice, uncertainty is pervasive and abstaining is often the safest action. We introduce MedQAbstain, a benchmark explicitly designed to evaluate medical abstention under uncertainty. MedQAbstain repurposes standard medical MCQA datasets by removing the gold answer and introducing an explicit "I abstain" option, framed as a safety-critical decision with clinical consequences. The benchmark supports systematic analysis across abstention regimes, distractor complexity, and input modalities, and elicits self-reported model confidence to study calibration. Across all settings, we find that state-of-the-art LLMs systematically overcommit, rarely abstaining even when the question itself is hidden. These results reveal a fundamental mismatch between LLM behavior and clinical norms, highlighting abstention as a critical but overlooked dimension of medical decision-making evaluation.

Anthology ID:: 2026.acl-long.1365
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 29573–29613
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.1365/
DOI:
Bibkey:
Cite (ACL):: Alessio Cocchieri, Luca Ragazzi, Giuseppe Tagliavini, and Gianluca Moro. 2026. LLMs (Almost) Never Abstain Under Medical Uncertainty. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 29573–29613, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: LLMs (Almost) Never Abstain Under Medical Uncertainty (Cocchieri et al., ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.1365.pdf
Checklist:: 2026.acl-long.1365.checklist.pdf

PDF Cite Search Checklist Fix data