LaCo: Layer-wise Compensation for Pruned Large Language Models

Yingen Liu; Fan Wu (吴凡, 吴钒); Panxuyan; Ruihui Li; Zhuo Tang; Kenli Li

LaCo: Layer-wise Compensation for Pruned Large Language Models

Yingen Liu, Fan Wu, Panxuyan, Ruihui Li, Zhuo Tang, Kenli Li

Abstract

Pruning is essential for the efficient deployment of Large Language Models (LLMs); however, it causes severe performance degradation due to the structural distortion induced by sparsity.Existing recovery strategies, such as LoRA, predominantly employ global fine-tuning, often overlooking the mechanistic root of this degradation: the layer-wise accumulation and amplification of local errors. To address this limitation, we propose LaCo(Layer-wise Compensation), a framework that reorients the recovery paradigm from global adaptation to hierarchical representation alignment.By sequentially optimizing each layer to reconstruct the model’s hidden states, LaCo effectively intercept the error propagation chain at its source.Extensive experiments demonstrate that LaCo surpasses parameter-efficient baselines in both perplexity reduction and zero-shot reasoning.Notably, it reduces recovery-time memory usage to approximately 1/7 of the baseline and requires only 2,048 unlabeled samples to match a LoRA model trained on 50k examples—achieving a ∼25× improvement in data efficiency.

Anthology ID:: 2026.acl-long.1342
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 29099–29113
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.1342/
DOI:
Bibkey:
Cite (ACL):: Yingen Liu, Fan Wu, Panxuyan, Ruihui Li, Zhuo Tang, and Kenli Li. 2026. LaCo: Layer-wise Compensation for Pruned Large Language Models. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 29099–29113, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: LaCo: Layer-wise Compensation for Pruned Large Language Models (Liu et al., ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.1342.pdf
Checklist:: 2026.acl-long.1342.checklist.pdf

PDF Cite Search Checklist Fix data