Adaptive Platt Scaling with Causal Interpretations for Self-Reflective Language Model Uncertainty Estimates

Anthony Sicilia; Malihe Alikhani

doi:10.18653/v1/2025.findings-emnlp.999

Adaptive Platt Scaling with Causal Interpretations for Self-Reflective Language Model Uncertainty Estimates

Abstract

As large language models (LLMs) are consumed by more users and deployed in increasingly autonomous capacities, their ability to self-monitor and ask for human intervention is of vital importance. Underlying this capability are fundamental skills like self-reflection and expression of uncertainty. In this work, we provide a formal analysis of LLM self-reflection for uncertainty estimation, using domain adaptation theory to model the shift between base predictions and reflective judgments. We use this to motivate a temperature scaling algorithm that calibrates uncertainty using comparisons between base predictions and LLM self-reflections. We evaluate our approach on challenging question-answering tasks requiring reasoning, demonstrating that our methods can improve calibration of uncertainty estimates and also offer improvements in human interpretation. More broadly, this use case shows how domain adaptation presents a promising analytical tool for understanding the underlying statistical properties of LLM self-reflections.

Anthology ID:: 2025.findings-emnlp.999
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2025
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 18414–18422
Language:
URL:: https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.999/
DOI:: 10.18653/v1/2025.findings-emnlp.999
Bibkey:
Cite (ACL):: Anthony Sicilia and Malihe Alikhani. 2025. Adaptive Platt Scaling with Causal Interpretations for Self-Reflective Language Model Uncertainty Estimates. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 18414–18422, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Adaptive Platt Scaling with Causal Interpretations for Self-Reflective Language Model Uncertainty Estimates (Sicilia & Alikhani, Findings 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.999.pdf
Checklist:: 2025.findings-emnlp.999.checklist.pdf

PDF Cite Search Checklist Fix data