Knowing When to Quit: Diagnosing and Training LLMs to Abort Futile Reasoning
Xinyan Guan, Jiali Zeng, Chunlei Xin, Yaojie Lu, Hongyu Lin, Xianpei Han, Le Sun, Fandong Meng
Abstract
Large language models generate computationally expensive yet semantically void reasoning on beyond-capability tasks, creating safety risks where plausible-sounding but incorrect derivations mislead users. We characterize this futile reasoning phenomenon through systematic analysis, revealing universal capability overreach and systematic miscalibration towards over-confidence. The dominant failure mode is specious reasoning, superficially valid outputs with subtle hallucinations, which escalates with task difficulty. We demonstrate that prompt engineering proves insufficient to calibrate refusal behavior. To address this, we introduce CaRL (Capability-aligned Reinforcement Learning), which aligns model behavior with capability boundaries through reward shaping that incentivizes refusal over hallucination and hindsight augmentation that converts failures into refusal supervision. Experiments demonstrate a substantial reduction in futile reasoning while preserving performance across task difficulties, effectively achieving capability-aligned behavior without sacrificing utility.- Anthology ID:
- 2026.findings-acl.830
- Volume:
- Findings of the Association for Computational Linguistics: ACL 2026
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 16823–16835
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.830/
- DOI:
- Cite (ACL):
- Xinyan Guan, Jiali Zeng, Chunlei Xin, Yaojie Lu, Hongyu Lin, Xianpei Han, Le Sun, and Fandong Meng. 2026. Knowing When to Quit: Diagnosing and Training LLMs to Abort Futile Reasoning. In Findings of the Association for Computational Linguistics: ACL 2026, pages 16823–16835, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- Knowing When to Quit: Diagnosing and Training LLMs to Abort Futile Reasoning (Guan et al., Findings 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.830.pdf