Optimizing Length Compression in Large Reasoning Models

Zhengxiang Cheng; Dongping Chen; Mingyang Fu; Tianyi Zhou

Optimizing Length Compression in Large Reasoning Models

Zhengxiang Cheng, Dongping Chen, Mingyang Fu, Tianyi Zhou

Abstract

Large Reasoning Models (LRMs) have achieved remarkable success, yet they often suffer from producing unnecessary and verbose reasoning chains. We identify a core aspect of this issue as ”invalid thinking”— models tend to repeatedly double-check their work after having derived the correct answer. To address this specific inefficiency, we move beyond the general principles of Efficacy and Efficiency to propose two new, fine-grained principles: Brevity, which advocates for eliminating redundancy, and Sufficiency, which ensures critical reasoning steps are preserved. Guided by these principles, we introduce LC-R1, a post-training method based on Group Relative Policy Optimization (GRPO). LC-R1 employs a novel combination of a Length Reward for overall conciseness and a Compress Reward that is specifically designed to remove the invalid portion of the thinking process. Extensive experiments on multiple reasoning benchmarks demonstrate that LC-R1 achieves a significant reduction in sequence length (5̃0%) with only a marginal (2̃%) drop in accuracy, achieving a favorable trade-off point on the Pareto frontier that prioritizes high compression. Our analysis further validates the robustness of LC-R1 and provides valuable insights for developing more powerful yet computationally efficient LRMs.

Anthology ID:: 2026.acl-long.146
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 3232–3250
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.146/
DOI:
Bibkey:
Cite (ACL):: Zhengxiang Cheng, Dongping Chen, Mingyang Fu, and Tianyi Zhou. 2026. Optimizing Length Compression in Large Reasoning Models. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 3232–3250, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Optimizing Length Compression in Large Reasoning Models (Cheng et al., ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.146.pdf
Checklist:: 2026.acl-long.146.checklist.pdf

PDF Cite Search Checklist Fix data