Truth or Dare: Analyzing LLM Susceptibility to External Evidence of Varying Factuality

Han-Yu Su, Kuan-Yu Chu, Yung-Hui Li, Lun-Wei Ku


Abstract
Modern Large Language Models (LLMs) often rely on Retrieval-Augmented Generation (RAG) to access up-to-date information; however, retrieved corpora may contain misleading, outdated, or incorrect content, raising concerns about how such evidence affects model reliability. In this work, we investigate the susceptibility of LLMs to false external evidence. Existing studies have shown that poisoned external corpora can mislead LLM responses; yet, there is still a lack of studies on the effects of different evidence properties. To bridge this gap, we design comprehensive experiments along three dimensions: styles of evidence, quantity of evidence, and the semantic similarity between external messages and the model’s internal belief. We find that instructive-style evidence demonstrates the most severe performance degradation. On the other hand, we observe a steady decline in model response quality as the amount of false evidence accumulates. Finally, we show that LLMs are more susceptible to factually incorrect evidence when their semantic similarity is close to the model’s parametric knowledge.
Anthology ID:
2026.trustnlp-main.39
Volume:
Proceedings of the 6th Workshop on Trustworthy NLP (TrustNLP 2026)
Month:
July
Year:
2026
Address:
San Diego, California
Editors:
Kai-Wei Chang, Ninareh Mehrabi, Satyapriya Krishna, Anubrata Das, Jwala Dhamala, Yang Trista Cao, Tharindu Kumarage, Anil Ramakrishna, Christos Christodoulopoulos, Yixin Wan, Aram Galystan, Anoop Kumar, Rahul Gupta
Venues:
TrustNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
528–538
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.trustnlp-main.39/
DOI:
Bibkey:
Cite (ACL):
Han-Yu Su, Kuan-Yu Chu, Yung-Hui Li, and Lun-Wei Ku. 2026. Truth or Dare: Analyzing LLM Susceptibility to External Evidence of Varying Factuality. In Proceedings of the 6th Workshop on Trustworthy NLP (TrustNLP 2026), pages 528–538, San Diego, California. Association for Computational Linguistics.
Cite (Informal):
Truth or Dare: Analyzing LLM Susceptibility to External Evidence of Varying Factuality (Su et al., TrustNLP 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.trustnlp-main.39.pdf