MedRedFlag: Investigating how LLMs Redirect Misconceptions in Real-World Health Communication

Sraavya Sambara; Yuan Pu; Ayman Ali; Vishala Mishra; Lionel Wong; Monica Agrawal

MedRedFlag: Investigating how LLMs Redirect Misconceptions in Real-World Health Communication

Sraavya Sambara, Yuan Pu, Ayman Ali, Vishala Mishra, Lionel Wong, Monica Agrawal

Abstract

Real-world health questions from patients often unintentionally embed false assumptions or premises. In such cases, safe medical communication typically involves redirection: addressing the implicit misconception and then responding to the underlying patient context, rather than the original question. While large language models (LLMs) are increasingly being used by lay users for medical advice, they have not yet been tested for this crucial competency. Therefore, in this work, we investigate how LLMs react to false premises embedded within real-world health questions. We develop a semi-automated pipeline to curate MedRedFlag, a dataset of 1100+ questions sourced from Reddit that require redirection. We then systematically compare responses from state-of-the-art LLMs to those from clinicians. Our analysis reveals that LLMs often fail to redirect problematic questions, even when the problematic premise is detected, and provide answers that could lead to suboptimal medical decision making. Our benchmark and results reveal a novel and substantial gap in how LLMs perform under the conditions of real-world health communication, highlighting critical safety concerns for patient-facing medical AI systems. Code and data are available at https://github.com/srsambara-1/MedRedFlag.

Anthology ID:: 2026.findings-acl.1771
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 35553–35578
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1771/
DOI:
Bibkey:
Cite (ACL):: Sraavya Sambara, Yuan Pu, Ayman Ali, Vishala Mishra, Lionel Wong, and Monica Agrawal. 2026. MedRedFlag: Investigating how LLMs Redirect Misconceptions in Real-World Health Communication. In Findings of the Association for Computational Linguistics: ACL 2026, pages 35553–35578, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: MedRedFlag: Investigating how LLMs Redirect Misconceptions in Real-World Health Communication (Sambara et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1771.pdf
Checklist:: 2026.findings-acl.1771.checklist.pdf

PDF Cite Search Checklist Fix data