Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient Reasoning

Xinglin Wang; Shaoxiong Feng; Yiwei Li; Peiwen Yuan; Yueqi Zhang; Chuyi Tan; Boyuan Pan; Yao Hu; Kan Li

Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient Reasoning

Xinglin Wang, Shaoxiong Feng, Yiwei Li, Peiwen Yuan, Yueqi Zhang, Chuyi Tan, Boyuan Pan, Yao Hu, Kan Li

Abstract

Self-consistency (SC), a widely used decoding strategy for chain-of-thought reasoning, shows significant gains across various multi-step reasoning tasks but comes with a high cost due to multiple sampling with the preset size. Its variants, Adaptive self-consistency (ASC) and Early-stopping self-consistency (ESC), dynamically adjust the number of samples based on the posterior distribution of a set of pre-samples, reducing the cost of SC with minimal impact on performance. Both methods, however, do not exploit the prior information about question difficulty. It often results in unnecessary repeated sampling for easy questions that could be accurately answered with just one attempt, wasting resources. To tackle this problem, we propose Difficulty-Adaptive Self-Consistency (DSC), which leverages the difficulty information of batch queries from both prior and posterior perspectives to adaptively allocate inference resources, further reducing the overall cost of SC. To demonstrate the effectiveness of DSC, we conduct extensive experiments on three popular categories of reasoning tasks: arithmetic, commonsense and symbolic reasoning on six benchmarks. The empirical results show that DSC consistently surpasses the strong baseline ASC and ESC in terms of costs by a significant margin, while attaining comparable performances.

Anthology ID:: 2025.findings-naacl.383
Volume:: Findings of the Association for Computational Linguistics: NAACL 2025
Month:: April
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 6904–6917
Language:
URL:: https://preview.aclanthology.org/Author-page-Marten-During-lu/2025.findings-naacl.383/
DOI:
Bibkey:
Cite (ACL):: Xinglin Wang, Shaoxiong Feng, Yiwei Li, Peiwen Yuan, Yueqi Zhang, Chuyi Tan, Boyuan Pan, Yao Hu, and Kan Li. 2025. Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient Reasoning. In Findings of the Association for Computational Linguistics: NAACL 2025, pages 6904–6917, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient Reasoning (Wang et al., Findings 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/Author-page-Marten-During-lu/2025.findings-naacl.383.pdf

PDF Cite Search Fix data