A Probabilistic Inference Scaling Theory for LLM Self-Correction

Zhe Yang; Yichang Zhang; Yudong Wang; Ziyao Xu; Junyang Lin; Zhifang Sui (穗志方)

A Probabilistic Inference Scaling Theory for LLM Self-Correction

Zhe Yang, Yichang Zhang, Yudong Wang, Ziyao Xu, Junyang Lin, Zhifang Sui

Abstract

Large Language Models (LLMs) have demonstrated the capability to refine their generated answers through self-correction, enabling continuous performance improvement over multiple rounds. However, the mechanisms underlying how and why accuracy evolves during this iterative process remain unexplored. To fill this gap, we propose a probabilistic theory to model the dynamics of accuracy change and explain the performance improvements observed in multi-round self-correction. Through mathematical derivation, we establish that the accuracy after the t^th round of self-correction is given by: Acc_t = Upp - 𝛼^t(Upp - Acc₀),where Acc₀ denotes the initial accuracy, Upp represents the upper bound of accuracy convergence, and 𝛼 determines the rate of convergence. Based on our theory, these parameters can be calculated and the predicted accuracy curve then can be obtained through only a single round of self-correction. Extensive experiments across diverse models and datasets demonstrate that our theoretical predictions align closely with empirical accuracy curves, validating the effectiveness of the theory. Our work provides a theoretical foundation for understanding LLM self-correction, thus paving the way for further explorations.

Anthology ID:: 2025.emnlp-main.685
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 13584–13598
Language:
URL:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.685/
DOI:
Bibkey:
Cite (ACL):: Zhe Yang, Yichang Zhang, Yudong Wang, Ziyao Xu, Junyang Lin, and Zhifang Sui. 2025. A Probabilistic Inference Scaling Theory for LLM Self-Correction. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 13584–13598, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: A Probabilistic Inference Scaling Theory for LLM Self-Correction (Yang et al., EMNLP 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.685.pdf
Checklist:: 2025.emnlp-main.685.checklist.pdf

PDF Cite Search Checklist Fix data