When More Thinking Hurts: Overthinking in LLM Test-Time Compute Scaling

Shu Zhou, Rui Ling, Junan Chen, Xin Wang, Tao Fan, Hao Wang


Abstract
Scaling test-time compute through extended chains of thought has become a dominant paradigm for improving large language model reasoning. However, existing research implicitly assumes that longer thinking always yields better results. This assumption remains largely unexamined. We systematically investigate how the marginal utility of additional reasoning tokens changes as compute budgets increase. We find that marginal returns diminish substantially at higher budgets and that models exhibit overthinking, where extended reasoning is associated with abandoning previously correct answers. Furthermore, we show that optimal thinking length varies across problem difficulty, suggesting that uniform compute allocation is suboptimal. Our cost-aware evaluation framework reveals that stopping at moderate budgets can reduce computation significantly while maintaining comparable accuracy.
Anthology ID:
2026.findings-acl.1199
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
23967–23977
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1199/
DOI:
Bibkey:
Cite (ACL):
Shu Zhou, Rui Ling, Junan Chen, Xin Wang, Tao Fan, and Hao Wang. 2026. When More Thinking Hurts: Overthinking in LLM Test-Time Compute Scaling. In Findings of the Association for Computational Linguistics: ACL 2026, pages 23967–23977, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
When More Thinking Hurts: Overthinking in LLM Test-Time Compute Scaling (Zhou et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1199.pdf
Checklist:
 2026.findings-acl.1199.checklist.pdf