AdapThink: Adaptive Thinking Preferences for Reasoning Language Models
Wenyue Xu, Xu Wan, Wei Wang, Wenqi Huang, Wotao Yin, Shengjie Zhao, Mingyang Sun
Abstract
The slow thinking paradigm has been widely validated to enhance the reasoning capabilities of Large Language Models (LLMs), but it introduces notable reasoning inefficiencies: models often overthink simple tasks while prematurely shifting their reasoning paths when addressing complex problems. To address this, we propose AdapThink, a simple yet efficient framework for adaptive reasoning preference control. Unlike methods imposing uniform length constraints, AdapThink dynamically adjusts reflection preferences based on group-level distributional statistics of reasoning length and reflection intensity. We further introduce a dispersion-based diversity sampling mechanism that maximizes the geometric spread of reasoning patterns, accelerating learning through exposure to diverse problem-solving strategies. Across mathematical reasoning and code generation benchmarks, AdapThink reduces average response length by 17.1%-21.4% while improving performance by 6.12-6.59 points under 32K token budgets, demonstrating superior efficiency and robustness against reward hacking compared to strong baselines.- Anthology ID:
- 2026.findings-acl.477
- Volume:
- Findings of the Association for Computational Linguistics: ACL 2026
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 9808–9825
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.477/
- DOI:
- Cite (ACL):
- Wenyue Xu, Xu Wan, Wei Wang, Wenqi Huang, Wotao Yin, Shengjie Zhao, and Mingyang Sun. 2026. AdapThink: Adaptive Thinking Preferences for Reasoning Language Models. In Findings of the Association for Computational Linguistics: ACL 2026, pages 9808–9825, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- AdapThink: Adaptive Thinking Preferences for Reasoning Language Models (Xu et al., Findings 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.477.pdf