Memory-Efficient Structured Backpropagation for On-Device LLM Fine-Tuning

JuneYoung Park; Yuri Hong; Seongwan Kim; Jaeho Lee

Memory-Efficient Structured Backpropagation for On-Device LLM Fine-Tuning

JuneYoung Park, Yuri Hong, Seongwan Kim, Jaeho Lee

Abstract

On-device fine-tuning enables privacy-preserving personalization of large language models, but mobile devices impose severe memory constraints, typically 6–12GB shared across all workloads. Existing approaches force a trade-off between exact gradients with high memory (MeBP) and low memory with noisy estimates (MeZO). We propose Memory-efficient Structured Backpropagation (MeSP), which bridges this gap by manually deriving backward passes that exploit LoRA’s low-rank structure. Our key insight is that the intermediate projection h = xA can be recomputed during backward at minimal cost since rank r ≪ d_in, eliminating the need to store it. MeSP achieves 49% average memory reduction compared to MeBP on Qwen2.5 models (0.5B–3B) while computing mathematically identical gradients. Our analysis also reveals that MeZO’s gradient estimates show near-zero correlation with true gradients (cosine similarity ≈0.001), explaining its slow convergence. MeSP reduces peak memory from 361MB to 136MB for Qwen2.5-0.5B, enabling fine-tuning scenarios previously infeasible on memory-constrained devices.

Anthology ID:: 2026.acl-industry.62
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Yunyao Li, Georg Rehm, Mei Tu
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 906–916
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-industry.62/
DOI:
Bibkey:
Cite (ACL):: JuneYoung Park, Yuri Hong, Seongwan Kim, and Jaeho Lee. 2026. Memory-Efficient Structured Backpropagation for On-Device LLM Fine-Tuning. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026), pages 906–916, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: Memory-Efficient Structured Backpropagation for On-Device LLM Fine-Tuning (Park et al., ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-industry.62.pdf

PDF Cite Search Fix data