SelfBudgeter: Adaptive Token Allocation for Efficient LLM Reasoning

Zheng Li; Qingxiu Dong; Jingyuan Ma; Di Zhang; Kai Jia; Zhifang Sui

SelfBudgeter: Adaptive Token Allocation for Efficient LLM Reasoning

Zheng Li, Qingxiu Dong, Jingyuan Ma, Di Zhang, Kai Jia, Zhifang Sui

Abstract

Recently, large reasoning models demonstrate exceptional performance on various tasks. However, reasoning models always consume excessive tokens even for simple queries, leading to resource waste and prolonged user latency. To address this challenge, we propose SelfBudgeter - a self-adaptive reasoning strategy for efficient and controllable reasoning. Specifically, we first train the model to self-estimate the required reasoning budget based on the query. We then introduce budget-guided GRPO for reinforcement learning, which effectively maintains accuracy while reducing output length. Experimental results demonstrate that SelfBudgeter dynamically allocates budgets according to problem complexity, achieving an average response length compression of 61% on math reasoning tasks while maintaining accuracy. Furthermore, SelfBudgeter allows users to see how long generation will take and decide whether to continue or stop. Additionally, users can directly control the reasoning length by setting token budgets upfront.

Anthology ID:: 2026.findings-acl.1063
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 21135–21156
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1063/
DOI:
Bibkey:
Cite (ACL):: Zheng Li, Qingxiu Dong, Jingyuan Ma, Di Zhang, Kai Jia, and Zhifang Sui. 2026. SelfBudgeter: Adaptive Token Allocation for Efficient LLM Reasoning. In Findings of the Association for Computational Linguistics: ACL 2026, pages 21135–21156, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: SelfBudgeter: Adaptive Token Allocation for Efficient LLM Reasoning (Li et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1063.pdf
Checklist:: 2026.findings-acl.1063.checklist.pdf

PDF Cite Search Checklist Fix data