CAT: Confidence-Adaptive Thinking for Efficient Reasoning of Large Reasoning Models

Qizhi Jiang; Shuo Wang; Pei Ke; Yuhang Song; Ke Qin

CAT: Confidence-Adaptive Thinking for Efficient Reasoning of Large Reasoning Models

Qizhi Jiang, Shuo Wang, Pei Ke, Yuhang Song, Ke Qin

Abstract

Large Reasoning Models (LRMs) have achieved remarkable success on complex tasks by leveraging long chain-of-thought (CoT) trajectories, yet they frequently exhibit overthinking on simple queries, resulting in significant token overhead and reduced inference efficiency. However, existing compression methods predominantly apply uniform length reduction or rely on coarse-grained difficulty estimation, often leading to performance degradation on difficult problems. To address this limitation, we propose Confidence-Adaptive Thinking (CAT), a framework that incorporates the model’s intrinsic self-certainty signals as confidence into the preference optimization process, which autonomously modulates reasoning lengths based on problem difficulty. Experimental results show that CAT consistently outperforms state-of-the-art baselines on reasoning accuracy across multiple benchmarks on different base models. Our work enables LRMs to effectively compress confident responses while deliberating on uncertain ones, offering a potentially robust solution for balancing accuracy and latency in practical industrial scenarios.

Anthology ID:: 2026.acl-industry.152
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Yunyao Li, Georg Rehm, Mei Tu
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2271–2283
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-industry.152/
DOI:
Bibkey:
Cite (ACL):: Qizhi Jiang, Shuo Wang, Pei Ke, Yuhang Song, and Ke Qin. 2026. CAT: Confidence-Adaptive Thinking for Efficient Reasoning of Large Reasoning Models. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026), pages 2271–2283, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: CAT: Confidence-Adaptive Thinking for Efficient Reasoning of Large Reasoning Models (Jiang et al., ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-industry.152.pdf

PDF Cite Search Fix data