Pru-CoT: Towards Efficient Reasoning Distillation via Pruning Chain-of-Thought

Han Liu, Shuotian Ma, Hui Li, Xiaotong Zhang, Fenglong Ma, Hong Yu


Abstract
Knowledge distillation has emerged as a pivotal paradigm for transferring the superior reasoning capabilities of Large Reasoning Models (LRMs) to efficient student models. However, the raw Chain-of-Thought (CoT) trajectories are often verbose and redundant, which dilutes the underlying logic and hinders effective knowledge distillation for student models. Although recent work has focused on pruning CoT to streamline these reasoning paths, existing local heuristic methods often fail to capture global causal logic due to rigid rules and limited search spaces, while global heuristic approaches incur substantial computational costs. To address these issues, we propose Pru-CoT (Pruning Chain-of-Thought), a framework that aims to extract the essential logical structure from reasoning chains. Pru-CoT implements a step-level importance assessment via global optimization on a frozen student large language model (LLM), quantifying the gradient-based causal contribution of each component. Guided by these important signals, the framework performs fidelity-constrained pruning, utilizing an LLM-driven process to synthesize concise, logically coherent narratives. Extensive experiments on mathematical reasoning benchmarks demonstrate that models trained with Pru-CoT not only achieve superior accuracy but also generate significantly more compact reasoning paths compared to those trained on raw verbose data.
Anthology ID:
2026.findings-acl.1684
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
33744–33756
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1684/
DOI:
Bibkey:
Cite (ACL):
Han Liu, Shuotian Ma, Hui Li, Xiaotong Zhang, Fenglong Ma, and Hong Yu. 2026. Pru-CoT: Towards Efficient Reasoning Distillation via Pruning Chain-of-Thought. In Findings of the Association for Computational Linguistics: ACL 2026, pages 33744–33756, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Pru-CoT: Towards Efficient Reasoning Distillation via Pruning Chain-of-Thought (Liu et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1684.pdf
Checklist:
 2026.findings-acl.1684.checklist.pdf