Large Language Models Are Overconfident in Their Own Responses

Mario Sanz-Guerrero, Manuel Mager, Katharina Von Der Wense


Abstract
Prior work has shown that instruction-tuned large language models (LLMs) are less well calibrated than their base pre-trained counterparts. However, little is known about the frequently used chat template’s effect on the calibration of conversational LLMs. In this work, we investigate the mechanisms driving this miscalibration by decoupling the effects of the post-training algorithm and the chat format. We find that, while instruction tuning fundamentally harms calibration, the chat template aggravates the issue through an “ownership bias” – models are significantly more confident in their *own* answers than in identical answers provided by a user. Extensive experiments across six recent open-weight LLMs, three benchmarks, and three confidence elicitation methods show that models assign up to 26% higher confidence to their own responses. Leveraging this insight, we propose a simple inference-time strategy: framing the model’s answer as user input during confidence elicitation. This approach significantly reduces overconfidence and improves calibration by up to 26% without the need for retraining, narrowing the gap between base and instruction-tuned models.
Anthology ID:
2026.findings-acl.1570
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
31406–31418
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1570/
DOI:
Bibkey:
Cite (ACL):
Mario Sanz-Guerrero, Manuel Mager, and Katharina Von Der Wense. 2026. Large Language Models Are Overconfident in Their Own Responses. In Findings of the Association for Computational Linguistics: ACL 2026, pages 31406–31418, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Large Language Models Are Overconfident in Their Own Responses (Sanz-Guerrero et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1570.pdf
Checklist:
 2026.findings-acl.1570.checklist.pdf