Reasoning about Uncertainty: Do Reasoning Models Know When They Don’t Know?

Zhiting Mei; Christina Zhang; Tenny Yin; Justin Lidard; Ola Sho; Anirudha Majumdar

Reasoning about Uncertainty: Do Reasoning Models Know When They Don’t Know?

Zhiting Mei, Christina Zhang, Tenny Yin, Justin Lidard, Ola Sho, Anirudha Majumdar

Abstract

Reasoning language models have set state-of-the-art (SOTA) records on many challenging benchmarks, enabled by multi-step reasoning induced by reinforcement learning. However, reasoning models are prone to generating confident, plausible responses that are incorrect (hallucinations). Knowing when and how much to trust these models is critical for safe deployment in real-world applications. To this end, we explore uncertainty quantification (UQ) of reasoning models in this work. We ask three fundamental questions: First, are reasoning models well-calibrated? Second, does deeper reasoning improve model calibration? Finally, inspired by humans’ innate ability to double-check their thought processes to verify the validity of their answers and their confidence, we ask: can reasoning models improve their calibration by explicitly reasoning about their chain-of-thought traces? We introduce introspective uncertainty quantification (IUQ) to explore this direction. In extensive evaluations on SOTA reasoning models across a broad range of benchmarks focused on knowledge-intensive tasks, we find that reasoning models: (i) are typically overconfident, (ii) become even more overconfident with deeper reasoning, and (iii) can become better calibrated through introspection (e.g., o3-Mini and DeepSeek R1) but not uniformly (e.g., Claude 3.7 Sonnet becomes more poorly calibrated). We conclude with important research directions to design necessary UQ benchmarks and improve the calibration of reasoning models.

Anthology ID:: 2026.findings-eacl.178
Volume:: Findings of the Association for Computational Linguistics: EACL 2026
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Vera Demberg, Kentaro Inui, Lluís Marquez
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 3408–3458
Language:
URL:: https://preview.aclanthology.org/ingest-eacl/2026.findings-eacl.178/
DOI:
Bibkey:
Cite (ACL):: Zhiting Mei, Christina Zhang, Tenny Yin, Justin Lidard, Ola Sho, and Anirudha Majumdar. 2026. Reasoning about Uncertainty: Do Reasoning Models Know When They Don’t Know?. In Findings of the Association for Computational Linguistics: EACL 2026, pages 3408–3458, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: Reasoning about Uncertainty: Do Reasoning Models Know When They Don’t Know? (Mei et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-eacl/2026.findings-eacl.178.pdf
Checklist:: 2026.findings-eacl.178.checklist.pdf

PDF Cite Search Checklist Fix data